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1  Summary  of  Objectives  and  Results 

This  research  has  concerned  optimal  and  adaptive  control  and  adaptive  identification  of 
distributed  systems.  Most  of  the  research  has  focused  on  digital  control  and  identification 
methods,  to  allow  for  real-time  implementation. 

Both  new  mathematical  theory  and  new  numerical  methods  have  been  primary  objec¬ 
tives  and  results  of  the  research.  Most  of  the  new  optimal  control  theory  has  concerned 
optimal  control  of  distributed  systems.  The  adaptive  control  methods  had  concentrated  on 
adaptive  disturbance  rejection,  or  noise  cancellation.  The  adaptive  identification  research 
has  contributed  both  novel  mathematical  theory  for  parameter  estimation  and  new  fast 
numerical  algorithms  for  adaptive  filtering. 

The  methods  for  adaptive  identification  and  disturbance  rejection  have  been  demon¬ 
strated  by  experimental  application  at  Wright  Patterson  AFB  and  the  Jet  Propulsion  Lab¬ 
oratory  in  Pasadena,  CA. 

2  Optimal  Control  of  Distributed  Systems 

The  research  on  optimal  control  of  distributed  parameter  systems  covered  two  basic  areas; 
numerical  methods  and  convergence  theory  for  optimal  control  of  flexible  structures  [6,  7,  8), 
and  integrated  design  of  controllers  and  distributed  sensors  for  smart  structures  [3,  10,  18]. 

2.1  Numerical  Methods  and  Convergence  Theory 

The  research  in  numerical  methods  and  convergence  analysis  in  [6,  7,  8]  continued  work 
begun  under  a  previous  AFOSR  grant.  This  work  developed  approximation  methods  for 
design  of  finite-dimensional  compensators  based  on  distributed  models  of  highly  flexible 
structures  such  as  space  antennae  and  satellites  with  large  flexible  solar  arrays.  The  main 
results  of  the  approximation  theory  were  various  necessary  and  sufficient  conditions  for 
a  finite-dimensional  compensator  to  converge  to  an  optimal  compensator  for  the  infinite¬ 
dimensional  model  as  the  or-^.er  of  the  compensator  increases. 

2.2  Integrated  Design  of  Controllers  and  Distributed  Sensors 

The  goal  of  the  research  in  [3,  10,  18]  is  to  guide  the  integrated  design  of  “smart  structures” 
and  low-order  compensators  for  smart  structures.  In  the  approach  taken  in  [6,  7,  8],  the 
order  of  a  near-optimal  compensator  might  be  very  large  for  a  highly  flexible  structure. 
This  drawback  motivated  the  research  in  |3,  10,  18],  which  was  initiated  under  this  grant. 
This  new  approach  leads  to  very  low-order  compensators  (sometimes  of  dimension  1)  when 
certain  distributed  sensors  are  designed  to  measure  the  correct  functionals  of  the  distributed 
state  vector.  While  the  work  in  [3,  10,  18]  emphasizes  fiber  optic  sensors,  the  same  approach 
applies  when  piezo-electric  films  are  used  as  distributed  strain  gauges. 

Whether  fiber-optic  or  piezo-electric  sensors  are  used,  the  methods  developed  in  (3, 
10,  18]  can  be  used  to  design  flexible  components  for  smart  structures,  and  this  was  the 
primary  motivation  for  the  research.  In  this  approach  to  the  design  of  smart  structures, 
the  distributed  sensors  are  embedded  in  the  flexible  beams  and  rods  that  are  connected  to 
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construct  a  large  complex  structure.  Embedding  fiber-optic  and  piezo-electric  sensors  in 
composite  materieds  is  now  a  proven  technology. 


3  Adaptive  Filtering  and  Identification 

The  research  on  adaptive  filtering  and  identification  has  included  derivation  of  fast,  nu¬ 
merically  stable  algorithms  for  real-time  parameter  estimation,  as  well  as  new  theoretical 
methods  for  the  study  of  the  asymptotic  properties  of  {adaptive  parameter  estimates  and 
the  derivation  of  a  new  generation  of  adaptive  filters. 

3.1  Adaptive  Lattice  Filters 

As  fm  as  useful  software  is  concerned,  the  most  important  research  imder  this  grant  was 
the  development  of  new  lattice  filters  [1,  2,  16]  for  adaptive  identification  and  filtering  of 
multichannel  systems.  We  believe  that  the  lattice  filter  in  [2]  is  the  best  on  the  market  for 
systems  with  many  input  and/or  output  channels,  especiaUy  when  the  paraUel  architecture 
developed  in  [2]  is  used. 

Least-squares  lattice  filters  are  fast  algorithms  for  adaptive  identification  of  linear  digital 
input/output  models.  In  addition  to  efficiency,  important  characteristics  of  lattice  filters  are 
numerical  stability,  order-recursiveness  and  suitability  for  parallel  architectures.  Recently, 
an  new  unwindowed,  or  covariance,  lattice  filter  was  introduced  in  [1]  for  solving  the  problem 
of  exact  initialization.  An  unwindowed  lattice  usually  achieves  much  faster  convergence  than 
prewindowed  lattices  when  the  data  has  nonzero  initial  conditions. 

In  estimating  a  process  with  p  channels,  most  lattice  filters,  including  the  lattice  in  [1], 
require  inversion  of  pxp  covariance  matrices.  The  unwindowed  lattice  filter  developed  in  [16] 
and  used  in  this  paper  incorporates  a  Gram-Schmit  procedixre  that  renders  the  covariance 
matrices  diagonal  This  improves  both  the  numerical  stability  and  the  efficiency  of  the 
lattice. 

It  has  been  demonstrated  in  previous  literature  that  lattice  filters  can  be  efl’ective 
for  adaptive  identification  of  flexible  structures,  but  the  previous  literature  used  prewin¬ 
dowed  lattices  and  primarily  single-input/single-output  models  and  data  The  results  in 
[11]  demonstrate  the  advantages  of  an  unwindowed,  multi-channel  lattice  for  identification 
of  complex  structures. 

The  multichaimel  least-squares  lattice  filter  in  [16]  is  used  to  estimate  parameters  for 
an  ARX  (autoregressive  with  exogenous  input)  model 

n  n 

-  *)  =  -  t), 

i=l  *=1 

t  =  0,l,2,...,  (3.1) 

where  y{t)  is  an  p-vector,  u{t)  is  a  m-vector,  and  the  ARMA  coefficients  Ai  and  Bi  are, 
respectively,  pxp  and  p  x  m  matrices. 

It  should  be  noted  that  the  basic  residual-error  lattice  involves  reflection  coefficients 
rather  than  the  ARMA  coefficients  Ai  and  Bi.  The  ARMA  coefficients  are  generated  from 
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the  output  of  the  residual  error  lattice  by  an  auxiliary  algorithm,  which  need  not  be  run  at 
each  time  step. 

One  of  the  most  important  features  of  the  lattice  filter  developed  in  [16]  is  its  suitability 
for  VLSI  realization.  The  forward-propagating  and  backward-propagating  quantities  on 
which  the  new  lattice  is  based  allow  the  lattice  to  be  realized  with  wavefront  arrays,  which 
are  among  the  most  efficient  VLSI  architecture’s.  New  wavefront  arrays  designed  for  the 
imwindowed  lattice  filter  are  presented  in  [16,  2). 

The  lattice  filter  in  [2,  16]  was  used  for  adaptive  identification  of  large  experimental 
aerospace  structures  from  input/output  data  in  [11,  9].  The  paper  [11]  resulted  from  our 
collaboration  with  Wright  Patterson  AFB,  and  [9]  resulted  from  our  collaboration  with  the 
Jet  Propulsion  Laboratory. 

3.2  Identification  of  an  Experimental  Truss 

The  structure  shown  in  Figure  1  is  an  experimental  truss  in  the  Flight  Dynamics  Laboratory 
at  Wright  Patterson  Air  Force  Base.  The  truss  is  12m  high.  Eight  actuator/sensor  pairs 
and  a  disturbance  actuator  are  attached  to  the  truss  as  shown.  Only  the  actuators  and 
sensors  located  on  the  top  of  the  truss  were  used  for  [11]. 

The  results  reported  in  [11]  were  obtained  by  using  the  multichannel  lattice  filter  to 
estimate  the  coefficients  in  (3.1)  adaptively.  To  generate  the  data  used  for  this  paper,  the 
disturbance  actuator  and  actuator  2  excited  the  truss  simultaneously  with  broad-band  force 
sequences.  These  two  input  sequences  and  the  corresponding  output  sequences  from  the 
four  sensors  were  sampled  and  recorded  at  50Hz,  and  the  lattice  filter  was  used  to  fit  the 
data  with  ARMA  models  of  all  orders  n  between  1  eind  40.  The  natural  frequencies  and 
damping  ratios  of  the  truss  are  estimated  by  computing  the  poles  of  the  estimated  ARMA 
model. 

Because  the  geometry  of  the  truss  is  almost  invariant  with  respect  to  90°  rotations,  the 
bending  modes  occur  in  pairs  with  almost  repeated  frequencies.  For  any  one  of  the  four 
sensors,  certain  bending  modes  are  highly  observable  and  certain  other  bending  modes  are 
marginally  observable.  Therefore,  accurate  identification  of  all  of  the  modes  is  possible 
only  by  using  all  four  sensors  simultaneously.  Figure  2  shows  adaptive  frequency  estimates 
obtained  using  all  four  sensors  as  output  channels  with  n  =  30. 

3.3  Adaptive  Minimax  Estimation  and  Filtering 

In  [4,  14,  15],  we  introduced  a  new  class  of  parameter  estimation  problems,  in  which  the  es¬ 
timated  parameters  are  minimizing  solutions  to  minimEoc  problems  for  quadratic  fit-to-data 
criteria.  Whereas  the  asymptotic  parameter  estimates  produced  by  least-squares  methods 
are  Markov  parameters  of  Kalman  filters,  the  asymptotic  parameter  estimates  produced  by 
the  order-recursive  minimax  problem  in  [4]  are  Markov  parameters  of  discrete-time  Hoo  fil¬ 
ters.  We  believe  that  the  ideas  and  results  in  [4]  constitute  the  theoretical  foundation  for  a 
new  generation  of  adaptive  filters  that  will  be  robust  to  disturbances,  unmodeled  dynamics, 
and  parameter  variations  in  the  plant.  We  are  pursuing  these  new  adaptive  filters  in  our 
current  research  sponsored  by  AFOSR 
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4  Adaptive  Control  and  Disturbance  Rejection 

There  are  many  aerospace  and  aeronautica]  applications  of  adaptive  disturbance  rejection, 
or  noise  cancellation.  For  example,  the  optical  instruments  used  in  aerospace  telescopes 
typically  are  mounted  on  flexible  structures  in  which  vibrations  are  excited  by  both  external 
disturbances  and  internal  engines  used  for  attitude  control  and  other  mission  purposes.  The 
optical  instruments  must  be  isolated,  often  actively,  from  such  vibrations  to  maintain  optical 
pathlength  errors  at  the  submicron  levels  required  for  space  telemetry  [9].  Also,  cabin  noise 
generated  by  aircraft  engines  and  aerodynamics  can  be  reduced  significantly  by  active  noise 
cancellation  methods. 

Under  our  current  AFOSR  grant,  we  have  developed  a  new  method  for  adaptive  distur¬ 
bance  rejection,  based  on  a  new  way  of  incorporating  a  disturbance  model  in  an  expanded 
plant  model  (17,  12,  13,  5].  While  the  controller  design  presented  in  (17,  13,  5]  uses  an  inter¬ 
nal  model  of  the  disturbance,  it  diflfers  from  previous  disturbance-rejecting  controllers  based 
on  internal  disturbance  models  because  it  separates  the  design  into  two  parts:  design  of  a 
basic  stabilizing  controller  for  the  plant,  and  design  of  a  disturbance-rejecting  augmentation 
to  the  basic  stabilizing  controller.  The  basic  stabilizing  controller  for  the  plant  is  designed 
independently  of  the  disturbance,  and  the  part  of  the  control  law  that  stabilizes  the  plant 
is  not  filtered  through  the  disturbance  dynamics  as  in  previous  disturbance-rejecting  con¬ 
trollers.  Also,  the  disturbance-rejecting  augmentation  for  the  controller  can  be  redesigned 
efficiently  without  changing  the  basic  stabilizing  controller.  The  ability  to  redesign  the 
disturbance-rejecting  part  of  the  controller  quickly  is  important  for  adaptive  disturbance 
rejection  in  applications  where  the  plant  remains  constant  but  the  disturbance  changes  over 
time 

The  method  that  we  developed  in  (17,  13,  5]  for  adaptive  disturbance  rejection  has 
performed  well  in  simulations  in  (17,  5]  and  in  an  experimental  application  [9],  where  the 
disturbance-rejecting  controller  isolated  a  high-precision  optical  instrument  from  sinusoidal 
disturbances  acting  on  a  flexible  truss  in  the  JPL  Phase  B  Test  Bed  (Figures  3  and  4).  The 
test  bed  was  developed  to  demonstrate  sub- micron  control  for  optical  instruments  to  be  used 
in  future  space  missions.  The  new  disturbance-rejecting  controller  was  used  to  reduce  optical 
path-length  error  when  the  structure  was  subjected  to  sinusoidal  disturbances.  Figure  4 
shows  open-loop  and  closed-loop  path-length  errors. 
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All  other  publications  have  been  provided  separately. 


Optimal  Design  of  Fiber  Optic  Sensors  for  Control  of  Flexible 

Structures  * 

C.-L.  Meng  and  J.S.  Gibson 
Mechanical,  Aerospace,  and  Nuclear  Engineering 
University  of  California,  Los  Angeles  90024-1597 

June  23,  1994 


Abstract 

This  paper  presents  methods  for  designing  fiber  optic  sensors  for  control  of  flexible  struc¬ 
tures.  The  sensors  are  designed  so  that  a  first-order  observer,  instead  of  a  high-order  state 
estimator,  can  be  used  to  construct  feedback  control  laws  based  on  distributed  or  high-order 
finite-dimensional  models  of  the  structure.  Design  methods  for  the  sensors  and  the  observers  are 
developed  for  two  types  of  pr^olems:  first,  where  the  sensors  extend  over  the  entire  structure, 
and  second,  where  the  sensors  extend  over  one  or  more  subregions. 


This  resesFch  was  supported  by  AFOSR  Grant  910016. 


1  Introduction 


Active  control  systems  often  are  required  to  reduce  vibrations  in  flexible  space  structures.  Recent 
research  [2],  [3],  [4],  [5],  [9],  [10],  [11],  [12],  [16],  [17]  has  shown  that  fiber  optic  sensors  can  be 
embedded  in  composite  materials  to  measure  spatial  integrals  of  strdn  along  the  length  of  the  fiber 
and  that  such  measurements  can  be  used  in  feedback  control  systems  to  damp  structural  vibrations 
actively.  Modal  domain  fiber  optic  sensors,  discussed  in  [3],  appear  to  be  best  suited  for  such 
applications.  As  shown  in  [11],  the  output  of  a  fiber  optic  sensor  can  be  represented  by  a  weighted 
spatial  integral  of  strain.  Also  [11]  showed  that,  for  a  single  actuator,  a  firsUorder  dynamic  observer 
can  be  used  in  the  feedback  loop  to  construct  an  arbitrary  linear  control  law  based  on  any  finite 
number  of  structural  modes.  The  first-order  observer  uses  the  output  of  two  fiber  optic  sensors  that 
must  be  tapered  appropriately. 

According  to  distributed-system  theory,  optimal  control  laws  for  flexible  structures  usually  re¬ 
quire  weighted  spatial  integrals  of  strain  and  velocity  [6].  Constructing  such  control  laws  from  the 
measurements  of  local  positions,  velocities,  accelerations,  and  strains  using  conventional  point  sen¬ 
sors  requires  infinite  dimensional  state  estimators.  These  estimators  are  approximated  by  high-order 
finite  dimensional  estimators  in  implementation.  Section  2  of  this  paper  presents  the  design  theory 
for  a  first-order  functional  observer  that  constructs  control  laws  derived  from  distributed  system 
theory.  This  observer  design  in  general  requires  the  sensors  to  be  distributed  over  the  entire  length 
of  the  structure.  Numerical  examples  in  Section  2.4  illustrate  the  theoretical  results  presented  in 
Section  2.1.  The  results  in  Section  2  were  developed  in  [8,  13]. 

In  applications,  it  may  not  be  possible  for  the  fiber  optic  sensors  to  be  distributed  over  the  entire 
igth  of  a  structure.  When  the  fiber  optic  sensors  extend  only  over  portions  of  a  flexible  structure 
there  will  exist  some  error  between  the  output  of  the  first  order  functional  observer  and  the  control 
law  that  the  observer  is  designed  to  construct.  In  this  case,  the  design  problem  is  to  find  the  optimal 
sensor  gains  so  that  the  error  is  minimized  and  the  control  system  can  achieve  high  performance. 
Section  3  discusses  the  design  method  for  fiber  optic  sensors  and  the  corresponding  observers  when 
the  sensors  extend  over  a  portion  of  the  structure.  Section  3.3  presents  the  numerical  examples.  The 
results  in  Section  3  were  developed  in  [13]. 

2  Sensors  Spanning  the  Entire  Structure 

2.1  Plant  Model  and  Energy  Space 

For  the  forced  linear  vibrations  of  flexible  structures,  the  common  abstract  evolution  equation  is 
[1,  6,  7,  15] 

«;(<)  +  Dow(t)  +  Aoti;(0  =  Bou(0-  (2.1) 

The  generalized  displacement  u;(t)  and  Bq  are  in  the  real  Hilbert  space  //.  The  forcing  function  u(t) 

is  a  real  valued  function  of  t.  The  stiffness  operator  Aq  is  a  coercive  self-adjoint  linear  operator  with 
domain  dense  in  H.  The  damping  operator  Do  is  a  nonnegative  linear  operator  bounded  relative  to 

Ao- 

The  natural  strain  energy  space  is  V=Dom(Ao'  )  with  inner  product 

(Aj'^.,Ay’.)ir.  (2.2) 

The  natural  state  space  for  first  order  form  of  (2.1)  is  the  total  energy  space 

E=VxH.  (2.3) 
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2.2  A  First-order  Functional  Observer 

Theorem  1  Let  fi  £V  and  /j  £  H,  u)(t)  be  the  solntion  of  Eq.(2.1)  and  define 

y  =  (/i .  Ht))v  +  {/a,  ii>(t))w. 

Let  ibi  €  and  iba  €  V.  The  two  measnretnents  yi  and  ya  are  defined  as: 

yi(0  = 

ya(0  =  {h,v){t))v- 

For  any  0  €  R,  the  reaUvalued  function  2(()  u  the  solution  of  the  following  equation: 

2(0  = -*(0  +  yi  +  MO- 

Define 

<  =  2(0  + 10  -  y- 

TAen  e  satisfies: 

c  +  c  =  0 

if  and  only  if 

/a€V 

Aa  =  /i  +  (V-V^S)/2 

*1  +  *a  =  /i  -  /a 
0  —  (/a.-Bo)^- 


(2.4) 

(2.5) 

(2.6) 

(2.7) 

(2.8) 

(2.9) 

(2.10) 

(2.11) 

(2.12) 


Proof  Conditions  (2.10)-(2.12)  follow  from  writing  the  equation  e  =  — c  in  detail  and  substituting 
the  K-inner  product  in  the  terms  involving  yi  and  ya.  The  inner  product  in  the  strain-  energy  space 


V  is 


( • ,  •)v  =  (aJ^*  -  •)«. 

(2.13) 

The  condition  (2.9)  comes  from  the  domain  requirements  of  the  operators  Ao, 
adjoints. 

In  applications,  (/a,tb(t))/f  in  (2.4)  can  be  represented  as 

aJ^*,  Do  and  their 

(/2.ti(0)w  =  f  hrv[t)dx. 

Jo 

(2.14) 

Measurements  in  (2.5)  are  spatial  integrals  of  strain; 

yi  =  /’*iV'(0di 

Jo 

(2.15) 

ya  =  /  k2W*'{t)dx. 

Jo 

(2.16) 

These  last  two  measurements  can  be  made  by  fiber  optic  sensors.  Theorem  1  says  that  an 
exponentially  convergent  first-order  observer  can  be  used  to  construct  a  bounded  linear  functional 
of  the  distributed  state  vector  (w,  w)  from  fiber  optic  sensor  data  if  and  only  if  the  velocity  weighting 
/a  is  in  the  strain-energy  space  V.  According  to  condition  (2.9),  /a  must  satisfy  smoothness  and 
boundary  conditions  for  functions  in  V.  These  conditions  will  require  a  certain  number  of  La 
derivatives  and  that  the  generalized  displacement  function  w{t)  satisfy  certain  geometric  boundary 
conditions. 
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In  applications,  (2.10)-(2.12)  must  be  computed  numerically  with  finite  element  or  modal  ap¬ 
proximation  schemes.  Such  numerical  methods  project  the  infinite  dimensional  problem  from  the 
energy  space  E  onto  a  sequence  of  finite  dimensional  subspaces  E„  =  V„  x  V„,  where  V„  €V.  The 
operators  Ao  and  Do  are  approximated  by  Aon  and  Don  in  Vn .  The  weighting  functions  f\  and 
are  approximated  by  /in  and  f^n  in  Vn-  The  functional  sensor  gains  ibi  and  are  approximated  by 
k\n  and  k^n  in  V"n  which  satisfy 


=  /l„  +  -  >4on  ^Sn)/2n  (2.17) 

I^lfi  +  k^n  =  /in  —  /rn-  (218) 

According  to  [6],  /in  and  will  converge  in  V— norm  to  the  functional  control  gains  /i  and  /2, 
respectively,  in  the  infinite  dimensional  control  problem.  Also  Irin  and  Ir2n  will  converge  in  V  to  the 
functional  sensor  gains  Iri  and  k^,  respectively. 


2.3  Finite  Dimensional  Model 


Consider  the  lateral  vibration  of  a  simply-supported  uniform  Euler-Bernoulli  beam.  The  finite 
dimensional  modal  approximation  scheme  is  used.  The  space  can  be  spanned  by  {^i, 
where  is  the  tth  mode  shape,  an  Vn  =  Hn-  If  u>(f)  is  the  solution  of  (2.1)  and  satisfies  the 
boundary  conditions.  w(t)  can  be  approximated  as 


tssl 


(2.19) 


where  is  the  modal  coordinate  of  the  ith  mode. 
Defined  the  state  to  be 


where  q(f)  is 


q(<)  = 


'  »?i(0  ' 


L  »?n(0  J 


The  state  equation  is 


X  =  Ax  +  Bu 


(2.20) 


(2.21) 


(2.22) 


where 


The  stiffness  matrix  Aon  is: 


(2.23) 

(2.24) 


(2.25) 


where  Ui  is  the  natural  undamped  frequency  of  the  ith  mode.  There  are  two  kinds  of  damping 
matrix  Don-  col  and  coAg^^.  The  choice  of  Don  will  influence  the  convergence  of  sensor  gains. 
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The  matrix  bon  is  an  n  x  1  column  vector  whose  elements  are  the  projections  of  the  actuator 
function  Bo(x)  onto  mode  shapes. 

Suppose  two  fiber  optic  sensors  extend  over  the  entire  length  of  a  beam.  Since  kin  &nd  ibjn  are 
in  Vn ,  kin  uid  k2n  can  be  represented  as  linear  combinations  of  eigenfunctions: 


*!"(*)  =  (2.26) 
1 

(2-27) 

1 

The  two  measurements  yi  and  yj  are 

yi(0  =  =  f‘k'i'nW"(t)dx  (2.28) 


PjCO  =  (*3n.«»(f))v.  =  /„' fci'„iy"(<)di. 

Define 


^In  ~ 

k2n  ~ 

(T2 

Lf.J 

(2.28)  and  (2.29)  are  equal  to: 

yi(0  =  kin’’>lonq 
»2(0  =  k2n^i4onq- 

Let  fin  €  Vn  and  /2„  €  V;,  and 

y  =  (/in.MO)*', +  (/2nX0)«. 
=  ^In^-^onq  +  f2n^^ 

where 


II 

fl 

- 1 

_ 1 

9 

II 

■  6i  ■ 

62 

L  7n  J 

.  . 

and  ji  and  6i  are  the  Fourier  coefficients  of  fin  and  f2n  on  ith  mode  shape  respectively. 
The  first  order  functional  observer  for  this  model  is 


(2.29) 

(2.30) 

(2.31) 

(2.32) 

(2.33) 

(2.34) 


Define 

and  c  satisfies 


i(f )  =  -z  +  yi  + 

( =  2(0  +  ya  -  y 


(2.35) 

(2.36) 


c  =  c.  (2.37) 

Substituting  (2.31),  (2.32),  (2.33)  and  (2.36)  into  (2.37),  one  can  have  the  following  matrix  equations 

k2n  ~  ^In  "b  (-^on  “  -^on  •^0n)L2n  (2.38) 

kjn  +  k2n  =  fin  ~  f2n  (2.39) 

0  =  t2iJ'bon.  (2.40) 


So  kin,  ^2n  solved. 
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2.4  Examples 

Consider  the  simply  supported  Euler-Bernoulli  beam.  The  real  Hilbert  space  H  =  I>2(0, 1),  V  = 
Ho(0, 1).  The  stiffness  operator  is 

Ao4  =  4""  (2.41) 

The  actuator  function  is 


sin  5ir(*  —  0.3) 
0 


if  0.3  <  *  <  0.5 
otherwise. 


(2.42) 


Two  cases  of  damping  are  considered 
Case  1:  Do  =  co^o^ 

Case  2:  Do  —  col  with  cq  =  0.02ir^. 


Figures  2.1-2.8  are  generated  using  the  finite  dimensional  model  describes  in  Section  2.3,  assuming 
El  =  I,  m  =  1.  Also  Don  =  in  Case  1  and  Don  =  cq/  in  Case  2. 


2.4.1  Constant  Velocity  Weighting:  A  Counter  Exsunple 

Suppose  we  want  to  build  a  first  order  functional  observer  to  construct  a  spatial  integral  of  transverse 
velocity  over  a  beam  from  two  fiber  optic  sensors  data.  Let  /i  =  0  and  /}  =  1  in  (2.4).  For  a  simply- 
supported  beam,  two  of  the  geometric  boundary  conditions  for  functions  in  V  are  zero  at  both  ends, 
so  fi^V.  From  Theorem  1,  there  exist  no  functional  sensor  gains  k\  and  such  that  the  desired 
y(t)  can  be  constructed. 

In  finite  dimensional  approximation,  the  vector  is  a  zero  vector  and  the  vector  f2n  is  the 
.La-projection  of  /}  =  1  onto  Vn.  Then  solving  (2.39)  and  (2.38),  one  can  get  Icjq  and  k20,  which 
are  the  modal  coeffidents  vectors  of  kin  <tnd  k^n-  A  first  order  observer  can  be  built  by  using  two 
fiber  optic  sensors  designed  according  to  kin  and  k%n  <snd  the  observer  can  construct  the  integral 
over  the  beam  of  the  projection  of  velocity  w{i)  onto  Vn. 

But  as  n  increases,  kin  and  Fan  do  not  converge  in  V.  As  a  result,  k"  and  k'i  diverge  in  La- 
Figures  2.1  and  2.2  illustrate  the  divergence  of  F"  in  both  damping  case. 


2.4.2  An  Optimal  Control  Problem 

For  both  damping  cases,  let  fi  and  /a  be  the  functional  control  gains  which  minimize  the  performance 
index:  ^ 

(IHIv +  !«  +  «*)*•  (2.43) 

Jo 

In  finite  dimensional  approximation,  fin  and  /an  can  be  obtained  by  solving  the  Riccati  Equation. 
[6]  guarantees  that  fin  and  /an  will  converge  to  fi  and  /a  in  V  and  H,  respectively.  Also  Fi 
and  Fa  are  approximated  by  Fi„  and  Fan  and  can  be  obtained  by  solving  (2.39)  and  (2.38).  The 
numerical  results  show  that  F2  is  two  orders  of  magnitude  smaller  than  F".  Therefore  we  conclude 
that  measurement  of  yj  can  be  omitted. 

Figures  2.3-2.8  show  /"„,  /an  and  F"„  with  n  increasing  for  two  damping  cases.  For  Do  =  coA]/^, 
Figure  2.4  shows  that  /an  converges  to  /a  and  /a  €  V.  So  condition  (2.9)  is  satisfied  and  Fi,  Fa 
exist.  Figure  2.5  shows  that  Fi„  converges  to  F" . 

For  the  case  Do  =  cq/.  Figure  2.7  shows  that  /an  still  converges  to  /a  with  /a  €  H,  but  /a  ^  V. 
Because  at  z  =  0.3  and  z  =  0.5,  /a  has  discontinuous  derivative.  This  fact  can  be  verified  by  the 
plots  for  large  n.  So  Fi  and  Fa  do  not  exist  according  to  the  theorem.  Figure  2.8  shows  that  Fj„ 
diverges  as  n  increases.  Therefore  k"  diverges. 
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Figure  2.6:  Do  =  cq/, 

n  =  20,30,40 


O  H-S.30.S0,4a 


n  =  5,20,30,40 


n  =  20,30,40 


n  =  5,20,30,40 


Ml  »kao.l».40 
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3  Op  timization  of  Sensors  Designs  for  Limited  Sensor  Spans 

3.1  Sensors  Extending  Over  Subintervals  of  a  Beam 

fiber  optic  sensors 


*2  *3  I  *4 


Figure  3.9:  Sensors  extend  over  portions  of  the  beam 

Suppose  the  first  fiber  optic  sensor  is  from  zi  to  Xi  and  the  second  one  is  from  Z3  to  Z4,  as  shown 
in  Fig.  3.9.  The  outputs  of  the  sensors  can  be  expressed  as: 


yi=  f  ki{x)w"(x,t)dz 
Jxx 

»2  =  /  k'^{x)w"{x,i)dx 


(3.44) 


(3.45) 


The  finite  dimensional  approximation  of  the  sensor  outputs  can  be  obtained  as  in  the  following, 
is  the  ith  mode  shape  of  a  simply  supported  beam.  Define  and  f  =  1, . . . ,  n,  in  such  a  way 

that 

_  /  ^"(*)  if  *1  <  *  <  *2 


l>fU)  =  /  if  *1  <  *  < 

• '  '  0  otherwise 

=  f  if  *3  <  *  < 

• '  '  (0  otherwise. 


The  geometric  boundary  conditions  are: 

^i(o)  =  ^.(0  =  0 
^.(0)  =  Mf)  =  0. 

There  are  unique  solutions  of  and  for  each  1.  ki  and  F2  can  be  approximated  as 


(3.46) 


(3.47) 


(3.48) 


(3.49) 


*2  =  53  (3.50) 

f=i 

Let  be  the  modal  coordinate  of  the  tth  mode.  Then  the  lateral  displacement  is  approximated 


(3.51) 
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Substituting  (3.49),  (3.50)  and  (3.51)  into  (3.44)  and  (3.45),  the  finite  dimensional  approximation 
of  the  sensor  outputs  are 

yi  =  ki^lV,q(/)  (3.52) 

yj  =  k2’’W'2q(0  (3.53) 

where  q(<)  is  defined  in  (2.21)  and  ki^,k2  are 


1 

■  *11  ■ 

*21 

w 

II 

*12 

k2  = 

*22 

*ln 

*2n 

and  W\  and  are  n  x  n  matrices  with  elements 


(3.54) 


[Wih  = 

/'’^''(x)^"(x)dx 

Jsi 

(3.55) 

mij  = 

/  ^"(x)d"(x)dx. 

(3.56) 

Suppose  we  want  to  construct  a  functional  observer  whose  output  will  converge  to  some  control 
law  asymptotically.  The  state  equation  is  defined  in  (2.20)  and  (2.22).  The  output  of  the  system  is 


yi  =  Cx  =  (CiOixnJx 

(3.57) 

where  Ci  =  kj^^Wi.  The  functional  observer  is  defined  as 

r  i  =  Fz  +  Gy\  + 

\  V>  =^  +  7W2- 

(3.58) 

Let  y  =  •^on<l  +  f2n  ^  control  law  that  the  functional  observer 

is  designed  to  construct. 

Assume  that  G  =  7  =  1,  F  = 

—  1,  and  define  the  error  between  the  output  of  the  functional  observer 

and  the  control  law  to  be 

€  =  <p-y. 

(3.59) 

t  satisfies 

(  +  t  =  0 

(3.60) 

if  and  only  if 

W2V.2  =  Aonfjjj  —  Z?onf2n  +  ^2n 

(3.61) 

H^iki  +  Aonf2n  ~  ~*^2k2  +  Aonfin 

(3.62) 

(3.63) 

For  the  finite  dimensional  model,  the  unique  solutions  of  kj^  and  k2  can  be  obtained  by  solving 
the  above  equations  if  IFi  and  are  nonsingular.  So  the  finite  dimensional  functional  sensor  gains 
ki  and  can  be  obtsuned.  As  the  number  of  modes  that  is  used  in  modal  approximation  increases, 
ki  and  kj  will  diverge  because  the  sensors  extend  only  over  portions  of  the  beam.  As  a  result,  the 
error  will  always  exist. 

On  the  other  hand,  if  sensors  are  distributed  over  the  entire  length  of  a  beam,  W\  and  are 
both  equal  to  the  stiffness  matrix  Aon>  and  the  finite  dimensional  solutions  k^  and  k2  definitely 
exist.  The  functional  sensor  gains  will  converge  as  long  as  conditions  in  Theorem  1  are  satisfied 
and  error  will  go  to  zero  asymptotically. 

Suppose  we  allow  the  error  to  exist,  the  design  problem  now  is  to  find  the  optimal  functional 
sensor  gains  ki  and  ^2,  which  will  converge  and  minimize  the  error  in  some  sense. 
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3.2  Optimization  of  the  Sensor  Gains 

The  open-loop  design  procedure  for  the  optimal  functional  sensor  gains  ki  and  k2  is  presented,  so 
that  the  Lt  norm  of  the  error  c  between  output  of  the  functional  observer  and  the  control  law  will 
be  minimized.  Suppose  the  length  of  the  beam  is  /.  Define  performance  index  to  be: 

J  =  r  +  r  l'[(k‘n\x)  -b  ik'^rix)]dx  (3.64) 

Jo  Jo 

There  are  two  reasons  that  we  add  the  second  integral  to  the  performance  index.  The  first 
reason  is  to  make  k"  and  k,  smoother  so  that  the  manufacturing  of  fiber  optic  sensors  is  feasible. 
Considering  (3.44)  and  (3.45),  the  outputs  of  the  fiber  optic  sensors  are  weighted  spatial  integrals 
of  strain  over  subintervals  of  a  beam.  Fiber  optic  sensors  are  tapered  according  to  the  value  of  k" 
and  k}  along  their  distribution.  Therefore  smoother  k"  and  kj  can  make  the  sensors  easier  to  be 
manufactured.  Also,  k"  and  k]  can  be  interpreted  as  the  sensitivity  to  local  strain.  !f  k"  and  k^'  are 
not  smooth,  then  there  may  be  a  abrupt  change  in  sensitivity  between  two  infinitely  close  points  on 
the  beam.  That  will  make  it  impossible  to  build  the  sensors. 

The  second  reason  is  that  penalizing  (k"')^  +  (k^")^  in  the  performance  index  can  make  k"  and 
k^  converge.  F^om  the  discussion  in  Section  3.1,  when  the  sensors  extend  over  only  subintervals  of 
the  beam,  error  may  always  exist.  So  if  we  only  penalize  €{t)^dt,  k"  and  kjf  will  diverge.  On  the 
other  hand,  just  a  small  value  of  r  can  make  them  converge.  Large  r  can  make  k"  and  k^'  smoother 
and  converge  faster  but  will  result  in  a  large  value  of  open-loop  error  in  loop  gain.  So  it  is  a  design 
trade  off. 

k]^  and  V.2  are  two  finite  dimensional  column  vectors  whose  elements  are  the  coefficients  of  ki 
and  k]  on  each  mode,  respectively.  Define 

(3.65) 

The  performance  index  can  be  represented  as  a  linear-quadratic  functbnal  of  k. 


3.2.1  Augmented  Plant  and  Norm  of  Error 


The  state  equation  of  the  beam  is  defined  in  (2.22)  and  the  first  order  functional  observer  is  defined 
in  (3.58).  The  augmented  state  is  defined  to  be: 


*=[*]■ 

(3.66) 

Then  the  augmented  state  equation  becomes; 

x  =  Ax  +  Bu 

(3.67) 

where 

■  a  A  1 

[  GC  F  ] 

(3.68) 

(3.69) 

Let  y  =  li^q  +  12^4  optimal  control  law  where  Ij^  and  I2  are  n  x  1  vectors.  The 

measurements  from  the  fiber  optic  sensors  are  yi  =  ki^lViq  and  yj  =  k2^W'jq,  where  Wi  and  W2 
are  defined  in  Section  3.1.  The  output  of  the  observer  is  ^  =  z  +  yy2.  Then  the  error  can  be  written 

as: 


€  =  <p-y 
=  Cx 


(3.70) 
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where 


3.2.2  FVequency  Domain 

Let  i(ju)  be  the  Fourier  'Dansform  of  the  function  ((t)  €  LjfO,  oo).  FVom  the  Parseval’s  relation  for 
aperiodical  signal, 

/o"  ^ /!L  <  oo  (3.79) 

therefore  i(ju>)  is  squue  Lebesgue  integrable  and  ((jv)  is  in  the  frequency  domain  space  L2(— oo,  oo). 
The  Z.2  norm  of  i(ju>)  is 

Il«0<*')ll2  =  €-(iw)^du; (3.80) 

The  meaning  of  ||<(iw)||2  is  the  energy  of  the  signal  ((jw).  Therefore  minimize  the  energy  of  signal 
i(ju)  in  frequency  domain  when  the  system  is  subject  to  sinusoidal  input  is  equivalent  to  minimize 
11^(0111  domain. 

Under  the  previous  assumptions  of  impulse  input  suid  zero  initial  augmented  state,  i(ju)  is 

t(jw)  =  C(jwJ  -  A)-^B.  (3.81) 


3.2.3  Solving  the  Lyapunov  Equation 

In  order  to  minimize  ||r(0ll2  (3-78),  we  have  to  solve  (3.76)  first.  Partition  the  solution  matrix  Q 

in  the  following  way; 

«=[«;  «:]  (3  82) 

where  Qi,  Q^,  Qs  and  are  2n  x  2n,  2n  x  1,  1  x  2n  and  1x1  matrices,  respectively.  Then  (3.78) 
can  be  rewritten  as 

l|c(0ll^  =  tr  BB^Qx  +  2fl’’Q2/3  -  ^ .  (3.83) 

(3.76)  becomes: 


C^G 

Qi 

Q2  1 

A  O' 

0^ 

F 

.  <33 

<3^ 

Q3 

<3j 

GCF 

(3.84) 


After  straightforward  matrix  multiplication,  we  can  have  the  following  equations; 

Qi  +  QiA  = 

-  [  ^  -  *1^)  ]  -  GiC-^Ql  +  QjC) 

A'*'Q7  +  C’'G<?4  +  Q^F  =  -  j 

Q2  =  QI 


2F 


where  7,  G  and  F  are  all  scalars.  Define 
H  = 


0 

Wi 


0  J  ^  [0  0  J 


1  = 


*2  J 


(3.85) 

(3.86) 

(3.87) 

(3.88) 

(3.89) 


Qj  can  be  simplified  to  be  a  linezu-  function  of  k; 

Q2  =  (A^  +  F/)-‘  ^  ) 


=  (A^  +  F/)-»((-7i/^  +  ^D’’)k  +  l] 
=  Mik  +  M3 


where  Mi  =  {A^  4-  FI)~^{—jH^  4-  -^D^)  and  M3  =  {A^  4-  F/)~M.  (3.85)  can  be  arranged  to  be 
a  Lyapunov  equation 

where  R  is  defined  as: 


(3.90) 
to  be 

(3.91) 


R  =  -fVj H 

+G{D'^V}J Ml  4-  M,kk^D4-  D^kMf  4-  Mjk^D). 


Since  A  is  asymptotically  stable,  the  solution  of  (3.91)  is 


Qi= 

Jo 


Re^'dt. 


(3.92) 


(3.93) 
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3.2.4  Linear- Quadratic  Functional  of  k 
Consider  (3.83)  first.  The  term  tr  BB^Qi  is  equal  to 


tr  BB^Qi 


=  r  e^*BB’^e^^*dt 

Jo 

-  2yl'^  n  e**BB'^e*'^*dt  H'^k 
Jo 

+  f  e^*BB'^e^^^dtl 
Jo 

+  Gk^Mj  I  e**BB'^t*"'Ui  D^\ 
Jo 

+  2GMl  f  t**BB'^e*^Ut  D^k 

Jo 


+  Gk^ 


e‘**BB'^e^  ‘dt  Mik. 


Define 


e^^BB^e*  *dt. 


Then  V  is  the  solution  of  the  Lyapunov  equation: 


AV-^-VA^  =  -BB'^. 

The  second  term  in  (3.83)  is  equal  to 

2B'^Q20  =  20B'^iMiK  +  M2). 

Therefore  square  of  L2  norm  of  error  is  a  linear-quadratic  form  of  k: 


r  c{t)^dt  =  k'^i.-r^HVH'^ -\.GM'^VD-¥GDVMi)k 

Jq 


-  2{jV  VH^  -  GM^  VD^  -  A/i)k 
+  i'^VI  +  20B'^M2-~ 


where  F,  G  and  j  are  scalars. 

Consider  the  term  /(J[(Jti')^(*)  +  in  performance  index. 

f[{kr)^{r,)  +  {kr)^{r,)]dT]=  r\kr)Hx)dx+  {k'^f{x)dz 

Jo  J*l  JS9 

Define  Ui  and  U2  such  that 

[Uih  =  r\ri^)<p';\x)dx 

Jsi 

[U2]ii=  r^rWJWr,. 

Jfl9 

Define  2n  x  2n  matrix  U  as: 

‘'=[0 1] 


Define  2n  x  2n  matrix  U  as: 


(3.94) 


(3.95) 


(3.96) 


(3.97) 


(3.98) 


(3.99) 


(3.100) 


(3.101) 


(3.102) 
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So  one  have  the  following: 


A(*i")’(*)  +  =  iJUk  (3  103) 

Jo 

Finally,  the  performance  index  is  a  linear-quadratic  form  of  k: 

J  =  +  (*?')’(*)]'/* 

=  k’‘flk-2f>k-|-r  (3.104) 

where 

R  =  y^HVH^  +  GMTVD  +  GDVMi  +  rU  (3.105) 

P  =  y\'^VH'^ -GM^VD"^ (3.106) 

:  „  r  =  I^Vl-t-2/?B^A/2-^.  (3.107) 

According  to  standard  results  on  linear-quadratic  optimisation,  if  R  is  nonnegative,  kg  minimizes 
J  if  and  only  if 

i2ko-P^  =  0.  (3.108) 

Since  is  in  range  space  of  R,  there  exist  at  least  one  kg  that  minimize  J.  If  A  is  positive  definite, 
kg  is  unique.  From  the  numerical  result,  R  is  ill-conditioned,  so  kg  is  not  unique.  The  singular 
v^ue  decomposition  method  is  used  and  kg  is  chosen  to  be  the  minimum  norm  solution. 


3.3  Examples 

The  control  law  y  =  l^x  is  chosen  to  minimize 


where  qo/ro  is  3000  and 


J  =  y* (  9o(Qx, x)  +  ro(u,  u)  )dt 
g  f  -^On  0  1 

^  I  0  J„x„  J 


(3.109) 

(3.110) 


The  plant  model  and  the  functional  observer  are  defined  in  Sections  2.3  and  3.1  respectively.  The 
length  of  the  Euler- Bernoulli  beam  is  1. 

Two  sensors  are  distributed  from  0.2  to  0.6.  The  actuator  Bo  is: 


sin  5x(z  —  0.3) 
0 


if  0.3  <  X  <  0.5 
otherwise. 


(3.111) 


Figures  3.11-3.14  demonstrate  the  influence  of  weighting  r  on  k"  and  ig.  Both  k"  and  kjf  are 
convergent  due  to  penalizing  r  /Q[(tJ")*(*?)  +  (*j”)*(>l)]‘^0  performance  index.  Larger  r  can  make 
them  smoother  but  result  in  large  open-loop  error,  as  illustrated  in  Tables  3.2-3.3. 

Let  n  be  the  number  of  modes  in  the  finite  dimensional  model  from  which  the  optimal  sensor 
gains  k]^ ,  k2  are  obtained  and  JV  be  the  number  of  inodes  of  finite  dimensional  approximation  for 
the  plant  in  application.  Figures  3.15-3.17  are  the  open-loop  magnitude  response  of  the  observer 
output  and  the  desired  control  law,  n  =  AT  for  each  case.  The  difference  between  the  two  curves 
in  each  of  Figures  3.15-3.17  is  the  square  of  La  norm  of  error  ||c(yw)||3  in  frequency  domain.  The 
value  of  ||c(iw)|||  is  equal  to  ||<(t)||3  and  is  shown  in  Tables  3.2-3.3  for  different  r  and  n  (n  =  N). 
The  magnitude  response  is  an  open-loop  response  because  the  observer  is  only  cascaded  to  the  plant 


14 


z 


0.2 


actuator  Bo(x) 

0.6 


A 


0  !  0.3  0.5 

two  fiber  optic  sensors 


Figure  3.10;  The  actuator  Bo{*)  &nd  the  distribution  of  collocated  sensors 


and  the  observer  output  does  not  feed  back  to  the  plant.  Therefore  it  is  the  open-loop  error  that  we 
minimize. 

Figures  3.18-3.20  are  the  closed-loop  magnitude  response,  with  n  =  N.  The  input  is  e“^*  acting 
at  the  actuator  function  Bo-  The  output  is  the  transverse  displacement  at  z  =  0.4.  In  each  plot 
the  solid  line  denotes  the  response  of  the  system  whose  compensator  using  a  functional  observer  in 
the  feedback  loop.  The  dotted  line  represents  the  ideal  full  state  feedback  control.  Comparing  two 
curves,  we  know  whether  the  observer  is  efficient  or  not. 

Figures  3.23  and  3.24  are  the  closed-loop  magnitude  responses  with  n  =:  5,1^  =  15  and  n  = 
10,  N  s:  15,  respectively.  In  Figure  3.23,  the  first  mode  is  lightly  damped  as  compared  to  other 
plots.  It  is  because  the  real  part  of  closed-loop  eigenvalue  of  observer  n  =  5  is  only  19  4%  of  that  of 
observer  n  =  15,  as  shown  in  Table  3.1. 

At  high  frequencies,  the  observer  achieve  high  performance.  This  can  be  verified  by  using  different 
r’s,  n’s  and  N's. 


first  mode  eigenvalue 

real  part 

imaginary  part 

n  =  5,N  =  15 

-1.220 

8.120 

n  =  10,  IV  =  15 

-3.764 

6.369 

n  =  15,7V  =  15 

-6.285 

6.671 

full  state  feedback 

-6.227 

9.719 

Table  3.1:  First  mode  eigenvalues 
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responses  of  observer  output  *p  and 
control  law  y,  n  =  5,  JV  =  5,  r  =  10“'^ 


Figure  3.18:  Closed-loop  magnitude 
response,  n  =  5,  IV  =  5,  r  =  10“** 


Figure  3.16:  Open-loop  magnitude 
responses  of  observer  output  <p  and 
control  law  y,  n  =  10,  iV  =  10,  r  =  10“** 


Figure  3.19:  Closed-loop  magnitude 
response,  n  =  10,  TV  =  10,  r  =  10“** 


responses  of  observer  output  ip  and 
control  law  y,  n  =  15,  TV  =  15,  r  =  10“** 


Figure  3.20:  Closed-loop  magnitude 
response,  n  =  15,TV  =  15,  r  =  10“** 
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Figure  3.21:  Open-loop  magnitude  Figure  3.23:  Cloeed-Ioop  magnitude 

responses  of  observer  output  tp  and  response,  n  =  5,  AT  =  15,  r  =  10~ 

control  law  y,  n  =  5,  N  =  15,  r  =  10“^* 


Figure  3.22:  Open-loop  magnitude  Figure  3.24:  Closed-loop  magnitude 

responses  of  observer  output  tp  and  response,  n  =  10,  iV  =  15,  r  s=  10" 

control  law  y,  n  =  10,  TV  =  15,  r  =  10“^* 


4  Conclusions 


A  first-order  functional  observer  for  an  optimal  control  law  for  the  distributed  model  of  a  one- 
dimensional  flexible  structure,  such  as  a  beam  or  rod,  can  be  constructed  if  the  outputs  of  two 
properly  designed  fiber  optic  sensors  are  available.  The  sensor  designs  are  determined  by  the  func¬ 
tional  gains  in  the  control  law  determined  from  control  theory  for  distributed  systems.  Usually, 
these  sensors  must  extend  over  the  entire  length  of  the  structure.  Theorem  1  in  Section  2  provides 
conditions  that  the  sensor  gains  must  satisfy.  The  most  important  one  is  the  existence  condition. 
If  the  existence  condition  is  satisfied,  it  follows  from  [6]  that  the  finite  dimensional  sensor  gains  will 
converge  as  the  number  of  modes  in  the  approximation  scheme  increases. 

When  the  sensors  extend  over  subintervals  of  a  structure,  the  sensors  are  designed  to  minimize 
the  error  between  the  desired  control  law  that  the  control  law  the  firs-order  observer  constructs 
from  the  sensor  outputs.  This  design  criterion  leads  to  a  quadratic  optimization  problem  for  any 
finite  number  of  modes  in  the  model  of  the  structure,  and  a  small  penalty  (i.e.,  regularization)  on 
appropriate  derivatives  of  the  functions  defining  how  the  sensors  are  tapered  guarantees  that  the 
sensor  designs  converge  as  the  number  of  modes  in  the  model  increase.  Section  3  formulates  the 
optimization  problem  and  demonstrates  the  design  procedures  in  detail.  The  numerical  examples 
demonstrate  that  the  resulting  closed-loop  control  system  achieves  high  performance. 
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Abetract 

This  paper  addresses  the  problem  of  fitting  digital  input/output  models  to  data  generated 
by  linear  systems  in  the  presence  of  white  process  and  sensor  noise.  The  systems  of  interest 
have  state-space  realizations  in  Hilbert  spaces.  Both  finite-dimensional  and  infinite-dimensional 
input/output  models  are  considered.  The  paper  derives  a  number  of  new  results  for  least- 
squares  estimation  and  filtering,  and  introduces  a  new  class  of  minimax  parameter-estimation 
and  filtering  problems.  The  main  results  characterize  the  asymptotic  values  to  which  parameter 
estimates  converge  with  increasing  amounts  of  data. 
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1  Introduction 


The  linear  systems,  or  plants,  considered  in  this  paper  have  time-invariant  discrete-time  realizations 
in  possibly  infinite-dimensional  Hilbert  spaces.  The  plants  are  driven  by  unknown  white  process-noise 
sequences  and  possibly  by  known  forcing  sequences.  Also,  the  measured  outputs  are  contzuninated  by 
white  sensor-noise.  The  objective  of  the  paper  is  to  characterize  the  asymptotic  values  of  estimated 
parameters  in  digital  input/output  models  when  such  models  are  fit  to  the  data  according  to  least- 
squares  (LS)  estimation  criteria  and  according  to  new  minimax  estimation  criteria  introduced  in  the 
paper. 

We  consider  one-step-ahead  output  prediction,  primarily  with  ARX  modeb,  which  are  models 
containing  linear  regressions  of  measured  outputs  and  known  inputs.  Also,  two  results  are  given 
on  least-squares  estimation  of  FIR  and  IIR  modeb,  which  use  only  hbtories  of  known  inputs  to 
predict  a  measured  output.  The  main  results  of  the  paper  for  LS  estimation  show  that,  under 
conunon  conditions,  the  asymptotic  least-squares  parameter  estimates  for  ARX  models  are  Markov 
parameters  of  Kalman  filters  that  would  be  constructed  for  the  plant  were  the  state-space  model  and 
the  noise  statistics  known.  The  hypotheses  about  the  noise  processes  are  consbtent  with  those  in 
Kalman  filtering  theory  for  finite-dimensional  systems  and  the  analogous  infinite-dimensional  theory 
[1,  2].  However,  constructing  a  Kalman  filter  requires  the  plant  and  nobe  statbtics  to  be  known, 
whereas,  in  the  parameter  estimation  problems  considered  here,  the  plant  and  noise  are  unknown; 
only  input/output  data  is  available. 

Since  we  are  concerned  with  the  limits  to  which  parameter  estimates  converge  as  increasing 
amounts  of  data  are  used,  we  require  that  the  plant  be  exponentially  stable  so  that  the  response 
approaches  a  steady  state,  or  stationary  response.  Abo,  the  known  forcing  sequence  b  required  to 
have  asymptotically  time-invariant  statistics  so  that  the  system  response  will  have  asymptotically 
time-invariant  statbtics.  Most  of  the  results  require  the  known  forcing  sequence  to  be  white.  Such 
conditions  are  realistic  because  it  b  common  in  experimental  identification  to  excite  a  system  with 
a  known  white  sequence  or  a  band-limited  sequence  obtained  by  filtering  a  white  sequence;  in  the 
latter  case,  the  band-pass  filter  used  in  obtaining  the  band-limited  forcing  sequence  becomes  part  of 
the  plant  to  be  identified. 

An  important  result  that  does  not  require  the  known  forcing  sequence  to  be  white  concerns  the 
problem  of  using  LS  estimation  to  fit  an  infinitely-long  ARX  model  to  input/output  data  from  a 
plant  of  the  class  considered  here.  In  thb  case,  the  asymptotic  parameter  estimates  are  the  Markov 
parameters  of  a  steady-state  (infinite-time)  Kalman  filter.  This  result  is  known  for  finite-dimensional 
plants,  and  is  the  basb  for  the  Observer/Kalman  Identification  (OKID)  method  developed  in  [3,  4,5]. 
This  paper  obtains  the  corresponding  result  for  infinite  dimensions,  but  more  important  are  the 
results  here  for  finite-dimensional  ARX  models,  since  only  finite-dimensional  modeb  can  be  used  in 
practice. 

It  b  shown  that,  under  common  conditions,  using  LS  estimation  to  fit  an  ARX  model  of  order 
N  to  input/output  data  from  either  a  finite-dimensional  or  an  infinite-dimensional  plant  produces 
asymptotic  parameter  estimates  that  are  equal  to  Markov  parameters  of  the  discrete-time  Kalman 
filter  that  solves  a  certain  minimum-variance  state-space  filtering  problem  on  an  interval  of  length 
N.  The  dimension  of  thb  state-space  filter  equab  the  dimension  (possibly  infinite)  of  the  plant  gen¬ 
erating  the  data  (even  though  thb  dimension  is  not  known  to  the  parameter-estimation  algorithm), 
but  the  length  of  the  time  interval  for  which  the  state-space  filter  b  optimal  equals  the  order  of  the 
ARX  model  identified.  Hence,  as  the  order  of  the  ARX  model  becomes  large,  the  corresponding 
Kalman  filter  approaches  the  steady-state  Kalman  filter.  (By  Markov  parameters  for  a  filter  with 
time-dependent  gains,  we  mean  the  coefficients  of  past  input  and  output  data  in  a  prediction  for¬ 
mula,  an  obvious  generalization  from  the  time-invariant  case.  See  Section  5.1.)  Another  important 
point  is  that,  while  the  asymptotic  parameter  estimates  for  the  infinite-dimensional  ARX  model  are 
independent  of  the  signal-to-nobe  ratio  of  the  input/output  data,  this  b  not  true  of  the  parameter 
estimates  for  a  finite-dimensional  ARX  model.  The  results  here  show  how  the  signal-to-nobe  ratio 
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afTncts  the  asymptotic  parameter  estimates  for  a  ftnite-dimensiooal  ARX  model. 


A  new  class  of  parameter  estimation  problems  is  introduced  in  Section  4,  where  the  estimated 
parameters  are  minimizing  solutions  to  minimax  problems  for  quadratic  fit-to-data  criteria.  The 
parameters  that  solve  the  order-recursive  minimax  problem  in  Section  4  are  Markov  parameters 
of  a  discrete-time  Hoo  filter  for  a  finite  time  interval.  Our  purpose  in  introducing  the  minimax 
parameter  estimation  problems  is  to  establish  the  basis  for  adaptive  filters  that  share  the  disturbance- 
attenuation  and  robustness  properties  of  Noa  filters  designed  for  known  state-space  models  [6,  7,  8, 
9,  10,  11,  12,  13,  14,  15,  16,  17,  18]. 

In  Section  4,  we  consider  only  finite-dimensional  ARX  models  because  numerical  parameter- 
estimation  algorithms  are  possible  for  the  finite-dimensional  models  only.  Under  the  hypotheses  of 
Section  4,  known  results  on  linear-quadratic  games  imply  that  the  Markov  parameters  of  an  Hoo 
filter  for  a  finite  time  interval  approach  the  Markov  parameters  of  a  steady-state  Hoo  filter  as  the 
length  of  the  interval  becomes  large.  Therefore,  it  follows  from  the  results  in  this  paper  that,  as 
the  order  of  the  ARX  model  becomes  large,  the  minimax  parameter  estimates  approximate  Markov 
parameters  of  a  steady-state  Hoo  filter. 

This  paper  does  not  address  numerical  algorithms  for  solving  the  parameter-estimation  problems 
analyzed  here.  There  are,  of  course,  many  sophisticated  methods  for  both  adaptive  and  off-line  least- 
squares  parameter  estimation  (adaptive  lattice,  transversal,  and  square-root  filters,  adaptive  and 
nonadaptive  QR  algorithms).  For  the  minimax  parameter-estimation  problems  introduced  in  this 
paper,  we  have  begun  to  develop  adaptive  (i.e.,  recursive)  numerical  methods,  but  no  methods  with 
the  efficiency  and  stability  of  modern  numerical  methods  for  LS  problems  exist  yet  for  the  minimax 
problems.  Fast,  numerically  stable  algorithms  should  come  soon.  Meanwhile,  the  purpose  of  this 
paper  with  regard  to  minimax  parameter  estimation  is  to  define  the  new  problems  and  characterize 
the  asymptotic  properties  of  the  solutions. 

The  analytical  methods  of  this  paper  are  influenced  by  the  following  practical  questions,  which 
underlie  the  entire  paper.  When  we  run  a  parameter-estimation  algorithm  with  a  particular  set  of 
experimental  data,  what  happens?  Do  the  parameter  estimates  converge?  If  so,  to  what?  Although 
our  assumptions  about  the  whiteness  and  independence  of  process  and  sensor-noise  sequences  are 
essential  to  the  methods  and  results,  we  view  the  central  questions  and  answers  of  the  paper  as 
fundamentally  deterministic.  Thus  the  analytical  methods  of  the  paper  are  purely  deterministic.  It 
can  be  argued  that  the  white-noise  sequences  that  we  assume  2U’e  sample  sequences  from  ergodic 
stochastic  processes,  but  the  central  questions  of  this  paper — about  what  happens  with  a  particular 
experimental  data  sequence — can  be  answered  in  a  deterministic  framework.  We  believe  that,  by 
stripping  away  the  probabilistic  machinery  usually  employed  when  studying  problems  with  random 
noise  (see  [19]  for  example),  we  cut  more  directly  to  the  questions  addressed  here  about  what  happens 
with  actual  data. 

We  set  up  the  basic  analytical  framework  in  Section  2  by  defining  correlation,  independence, 
and  whiteness  for  scalar  and  Hilbert-space-valued  sequences.  These  definitions,  which  involve  time 
averages,  are  standard  for  scalar  sequences,  and  the  generalizations  to  Hilbert-space- valued  sequences 
are  streughtforward.  The  novel  component  of  this  framework  is  a  Hilbert  space  containing  equivalence 
classes  of  deterministic  scalar  sequences.  The  parameter-estimation  problems  are  formulated  and 
analyzed  in  this  space.  Because  we  want  to  use  the  correlation  of  deterministic  scalar  sequences 
as  an  inner  product,  the  Hilbert  space  must  contain  equivalence  classes  instead  of  scalar  sequences 
themselves.  Two  scalar  sequences  are  defined  to  be  equivalent  if  their  difference  is  asymptotically 
zero  in  the  mean-square  sense.  The  correlation  of  two  scalar  sequences  is  the  inner  product  of  their 
respective  equivalence  classes.  In  particular,  uncorrelated  sequences  (i.e.,  sequences  with  correlation 
0)  belong  to  orthogonal  equivalence  classes. 

We  define  and  analyze  parameter-estimation  problems  for  infinite  data  sequences  in  Sections  3 
(least-squares  problems)  and  Section  4  (minimax  problems).  While  the  most  important  interpre- 
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tation  of  the  peer’s  main  results  is  the  characterization  of  the  solutions  to  certain  parameter 
estimation  problems  for  ARX  models  of  an  unknown  plant  in  terms  of  Kalman  and  Hoo  Alters  for 
state-space  models  of  a  known  plant,  we  do  not  employ  state-space  Altering  theory,  stochastic  or 
otherwise,  in  our  basic  analysis.  Rather,  the  framework  of  white  deterministic  sequences  estab¬ 
lished  in  Section  2  makes  it  easy  to  show  that  the  parameter  estimation  problems  are  equivalent 
to  certain  linear-quadratic  (LQ)  control  problems.  The  LS  estimation  problems  are  equivalent  to 
linear-quadratic  regulator  (LQR)  problems,  and  the  minimax  estimation  problems  are  equivalent  to 
linear-quadratic  games. 

Section  5  shows  the  precise  relationships  of  the  estimated  ARX  parameters  to  state-space  Alters. 
This  is  the  only  section  of  the  paper  that  uses  any  state-space  Altering  theory.  It  is  most  convenient 
to  characterize  the  solutions  to  the  parameter  estimation  problems  Arst  in  terms  of  the  detailed 
solutions  to  the  corresponding  optimal  control  problems;  Riccati  operators,  closed-loop  systems, 
etc.  (These  details  are  not  discussed  in  Sections  3  and  4.)  The  relationships  of  the  estimated 
parameters  to  Kalman  and  Hoo  Alters  follow  then  from  the  dual  mathematical  structures  of  optimal 
linear-quadratic  controllers  and  optimal  state-space  Alters. 

Section  6  discusses  parameter-estimation  problems  with  Anite  data  sequences,  the  only  parameter- 
estimation  problems  that  can  be  solved  numerically.  These  problems  are  just  restatements  of  the 
problems  in  Sections  3  and  4  except  that  the  objective  functionals  in  Section  6  are  deAned  for  Anite 
data  sequences.  The  main  motivation  for  the  framework  set  up  in  Section  2  is  to  2dlow  the  deAnitions 
in  Sections  3  and  4  of  parameter-estimation  problems  to  which  the  problems  in  Section  6  converge 
as  the  lengths  of  the  Anite  data  sequences  increase.  According  to  the  theor^'^s  in  Section  6,  the 
psirameters  that  solve  the  problems  with  Anite  data  sequences  converge  to  the  parameters  that  solve 
the  corresponding  problems  with  inAnite  data  sequences.  This  convergence  follows  easily  from  the 
problem  deAnitions  in  Sections  3  and  4. 
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2  Statistics  of  Deterministic  Sequences 

All  aequencea  in  thia  paper  will  have  the  form 

y  =  [ito  y»  yj  •  ••  ]•  (2  i) 

The  ahift  operates  ia  defined  by 

«~‘y  =  (0  yo  yi  yj  •  •  •  ].  (2-2) 

and  9""  meana  for  any  nonnegative  integer  n. 

For  y  and  v  acalar  aequencea,  we  define 

fy,  4  =  J  53 

s=0 

lly|  =  «y.yr'=‘  =  (iim7^|y<n»/=*  (2.4) 

1^00  I 

»=0 

when  the  limits  exist.  (We  say  that  such  limits  exist  only  if  they  are  finite.)  Two  scalar  sequences  y 
and  V  have  correlation  a  if  the  limit  in  (2.3)  exists  and  equals  the  (finite)  complex  number  o;  y  and 
V  are  independent  if  q~"'y  and  q~’'v  have  correlation  0  for  all  nonnegative  integers  m  and  n. 

Let  w  and  u  be  sequences  in  Hilbert  spaces  W  and  U,  respectively.  For  each  q  €  W, 

=  ((u'o,>?)iv  {^2,v)w  •••)  (2-5) 

is  a  scalar  sequence.  The  sequences  w  and  u  have  bounded  correlation  if,  for  each  q  €  W  and  (  SU, 
the  scalar  sequences  [u;,q]|  and  (u,{]  have  some  correlation  a.  When  this  is  the  case,  the  Principle 
of  Uniform  Boundedness  implies  that  there  exists  a  real  number  r  such  that 

|«u,^]l,[u;,ql»|  <  r^KI  •  |q|  Vq  €  W,  «  G  f/.  (2.6) 

When  w  and  u  have  bounded  correlation,  there  exists  a  unique  bounded  linear  operator  IP*'"  from 
W  to  U  such  that 

«[«.^l,KqI))  =  («““'q.0i/-  (2.7) 

The  operator  IP*'"  is  called  the  correlation  operator  for  u  and  w.  The  Hilbert-space-vaJued  sequence 
w  has  bounded  autocorrelation  if  the  conditions  for  tv  and  u  to  have  bounded  correlation  hold  with 
u  =  w.  In  this  case,  we  write  IP"  for  IP"'".  The  operator  R'"  is  called  the  autocorrelation  operator 
for  w,  and  IP"  is  self-adjoint  and  nonnegative.  (Operators  analogous  to  the  correlation  operators 
here  have  been  defined  for  Hilbert-space-valued  random  variables.  See  [1,  2]  for  example.) 

The  Hilbert-space-valued  sequences  w  and  u  are  independent  if  the  scalar  sequences  ^tv,  q]|  and 
[u,(]  are  independent  for  all  q  G  W  and  ^€U.  Hence  w  and  u  are  independent  if  and  only  if  q~"'w 
and  9~”u  have  bounded  correlation  and  zero  correlation  operator  for  all  nonnegative  integers  m  and 
n. 

A  Hilbert-space-valued  sequence  w  is  white  if  w  has  bounded  autocorrelation  and  w  and  q~''w 
have  bounded  correlation  and  zero  correlation  operator  for  all  positive  integers  n. 

In  the  following  sections,  the  input  sequences  to  linear  systems  will  satisfy  at  least  the  following 
condition  for  a  sequence  u  in  a  Hilbert  space  U : 

q~'u  and  q~^u  have  bounded  correlation  for  all  nonnegative  integers  t  and  j.  (2.8) 
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Theorem  2.1  (Convolution)  Suppose  that  u  satisfies  (2.8),  for  some  r  >  0,  and  a,  6  U 

with 

00 

^|ai|<c».  (2.9) 

isQ 


Define  the  scalar  sequence  y  by 


Vk  =  i  =  0,l,  ....  (2.10) 

i=0 

where  «*_,  =  0  when  i  >  k.  For  this  y,  the  limit  in  (2.4)  ezt5<«  and  the  following  inequahttes  hold: 

(2.11) 

t=0 

n  00 

By- <  »•  53  k!,  n  =  0,l .  (2.12) 

1=0  <=n+l 

If  V  is  a  scalar  sequence  with  bounded  autocorrelation  and  if  v  and  q~’u  have  bounded  correlation 
for  all  nonnegaiive  integers  i,  then  the  limit  in  (2.3)  exists. 

Proof  Define 


,vh  =  jEuiVi,  |y|l.  =  [7Ely.P]‘'^  i  =  1,2,3, 


(2.13) 


isO 


•  sO 


y"  =  n  =  0,l,2,  ....  (2.14) 

•=o 

The  Principle  of  Uniform  Boundedness  and  (2.8)  imply  that  there  exists  a  real  number  p  such  that 


<  pI^I  V(€U,  i>0,t>  1,  (2.15) 

n 

«y"-y'"a.  <p  kl,  Vn>m,  <>1.  (2.16) 

i=m+l 

Also,  i?“  <  implies 

<  rj^l  Vi>0,  (2.17) 

n 

|y"-y">|<r  kl,  Vn>m,  (2.18) 

•=m+l 

Hence,  Jj/”)  converges  to  some  real  p.  For  m.n  >  0, 

lBy"l.  -  |y"ll  <  |y"  -  y'"l.  +  lly"*!.  -  |y"’ll  +  lly'”  -  y"|.  (2.19) 


Since  lim,_oo  ly"*!*  =  |y"*|  for  each  m,  (2.16) — (2.19)  imply  that,  as  t  — ►  oo,  |y"||,  — ►  |y"||  uniformly 
in  n.  That  the  limit  in  (2.4)  exists  follows  then  from 


p  =  limlimly"||,  =  limlim  Jy^B,  =  lim||y|,  =  |ly|.  (2.20) 

n  I  t  n  t 

Since  lim,  ||y"|t  =  ly"[,  (2.14),  (2.17),  and  (2.20)  imply  (2.11).  Then  replacing  y  in  (2.11)  with 
y  -  y"  yields  (2.12). 
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Since  v  and  q~*u  have  bounded  correlation  for  all  nonnegative  i,  v  and  have  bounded  correla¬ 
tion  for  all  m,  and  (2.18)  implies  that  is  a  Cauchy  sequence.  For  n  >  t,  |y,  v|i  = 

Let  (  >  0,  2uid  choose  m  such  that  the  right-hand  side  of  (2.16)  is  less  than  c.  Choose  t  such  that 
|{y”*,  v^t  —  fy*",  v]^|  <  (,  and  let  n  =  t  -f-  m.  Then 

\iy,vh  -  «y"’. <  l((y"  -  +  l«y'",v)i  -  Cy'".t;||  <  (sup|v|.  -h  i)f.  (2.2i) 

t 

Therefore, 

lim((y,v>,  =  limfy'",t>)).  □  (2.22) 

I  m 

When  u  satisfies  (2.8),  we  define  5u  to  be  the  set  of  all  scalar  sequences  y  having  the  form  in 
(2.10)  with  Oi  satisfying  (2.9).  Then  5u  is  a  linear  space,  and  Theorem  2.1  implies  that  (2.3)  and 
(2.4)  define,  respectively,  a  sesquilinear  functional  and  a  seminorm  on  Su- 

Definition  2.2  Two  scalar  sequences  y  and  v  are  equivalent  if  |y  —  v|  =  0.  Two  sequences  w  and 
w  in  a  Hilbert  space  W  are  equivalent  if  [u;,  q]  and  [tn,q]  are  equivalent  'dt)  €W. 

Theorem  2.3  If  w  and  w  are  equivalent  sequences  in  a  Hilbert  space  W  and  u  and  u  are  equivalent 
sequences  in  a  Hilbert  space  U,  then  w  and  u  have  bounded  correlation  and  correlation  operator 
if  and  only  if  w  and  u  have  bounded  correlation  and  correlation  operator 

When  u  satisfies  (2.8),  we  define  5u  to  be  the  set  of  all  equivalence  classes  of  elements  of  5u,  so 
that  5u  is  an  inner-product  space,  or  pre-Hilbert  space,  with  inner  product  and  norm  determined 
by  (2.3)  and  (2.4),  respectively.  If  u  and  w  are  sequences  in  Hilbert  spaces  U  and  W,  respectively, 
if  u  and  w  each  satisfy  (2.8),  and  if  q~*u  and  q~lw  have  bounded  correlation  for  all  nonnegative 
integers  t  and  j,  then  Su®Su,  is  an  inner-product  space  with  inner  product  and  norm  determined 
by  (2.3)  and  (2.4). 

When  u  satisfies  (2.8),  we  define  the  Hilbert  space  5u  to  be  the  completion  of  iSu-  Hence  the 
Hilbert  space  Su  ®  Su,  is  the  completion  of  Su  0  Su,  (under  the  conditions  stated  for  Su  0  Su,  to  be  an 
inner  product  space).  If  u  and  w  are  independent,  Su  and  Su,  are  orthogonal.  Whether  every  element 
of  Su  is  an  equivalence  class  of  scalar  sequences  is  a  open  question,  which  does  not  matter  for  the 
rest  of  this  paper  because  the  only  elements  of  Su  0  Su,  encountered  are  obtained  from  convolutions 
of  the  generating  sequences  u  and  w. 

Henceforth,  the  notation  in  this  paper  usually  will  not  distinguish  between  a  scalar  sequence  s 
and  the  equivalence  class  of  scalar  sequences  equivalent  to  s  when  s  is  a  data  or  noise  sequence.  This 
abuse  of  notation  is  similar  to  the  common  practice  of  referring  to  elements  of  the  Hilbert  space 
1.2(0, 1)  as  square-integrable  functions.  On  occasion,  we  will  refer  to  the  following  condition  for  a 
Hilbert-space-valued  sequence  u; 

Ig~*u,^J  ^  closed  span{l5~^«,^J  ;  j  >  0,  j  i)  Vi  >  0  and  €  I/,  (2.23) 

where  u*  €  1/  (it  =  0, 1,2,  ... )  and  closed  span{[g~^«,fj  '■  j  >  0,  j  ^  i)  is  a  subset  of  Su-  While 
the  technically  correct  interpretation  of  (2.23)  requires  that  and  be  interpreted  as 

equivalence  classes  of  scalar  sequences,  the  practical  meaning  of  (2.23)  is  that,  for  any  two  elements 
(  and  (  of  U,  the  scalar  sequence  [9~'u,^]  cannot  be  approximated  arbitrarily  closely  in  the  sense  of 
the  seminorm  in  (2.4)  by  linear  combinations  of  the  scalu  sequences  j  ^  i.  The  condition 

(2.23)  holds  if,  for  example,  u  is  a  finite  linear  combination  of  periodic  sequences  and  a  nonzero 
white  sequence. 

In  the  rest  of  this  paper,  ((-,  ■))  and  ||  •  |  will  denote  the  inner  product  and  norm  on  Su  ®  Su,, 
while  (•,  •)  and  |  •  |,  without  subscripts,  will  denote  the  inner  products  and  norms,  respectively,  on  all 
other  Hilbert  spaces  (including  C,  the  complex  plane).  There  should  be  no  ambiguity  about  which 
elements  belong  to  which  spaces. 
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3  Least-Squares  Estimation  and  Prediction 

We  let  U ,  W,  and  Y  be  Hilbert  spaces,  and  we  assume  that  there  exist  absolutely  sununable 
sequences  of  bounded  linear  operators  £,J“  €  B(U,Y),  ij’"  €  B(W,Y)  and  absolutely  summable 
sequences  cj“  6  i/,  cj*  €  W,  aj"  €  U,  and  aj*  €  Y.  For  each  *,  we  define  Ai'‘  €  U\  €  Y\ 
XJ“  €  f/',  and  IJ*  €  W  by 

Ai'‘v={v,c^*n,  A[%=(f,.a[*),  =  {v.c’n,  (31) 

where,  in  each  case,  q  is  in  i/,  W,  or  y  as  needed.  (When  U,  for  example,  is  C",  each  aj"  is  an 
n-dimensional  column  vector  and  AJ”  is  the  complex-coigugate  of  the  transpose  of  0^“  ) 

For  u  a  sequence  in  U,  we  write 

I);"U  =  (LJ“U0  ij""!  ^*““2  ^i“«=K“UO  Al'^Ui  ^i“U2  •  •  (3.2) 

and  similar  equations  for  Zj*",  ZJ“,  etc.  We  define  the  convolution  operators  Z*“,  j4'“,  Z**",  Z'“', 
Z*“,  and  A‘*  by 

OQ  00 

Z''“u  =  52z]'“?-*u,  =  etc.  (3.3) 

fc=0  t=0 

Each  of  these  operators  maps  sequences  to  sequences.  Changing  the  order  of  summation  in  A‘*L*'‘u 
and  yields 

A-'v-u = (31) 

tsO  ;k=0  t=0  ifcsO 

00  t  00  • 

A-v'w = xfirr.],-'® = i:!?-'"-.  E  (35) 

1=0  tsO  isO  i=0 

where  I-,-)  means  a  scalar  sequence  defined  as  in  (2.5),  and  Zf”*  and  Z}'"'*  are  the  adjoints  of  Zf" 
and  Zj",  respectively.  Hence, 

L-u  -  .4*“u  -  A'^Z'-U  =  f;|[q-'«.lcr  -  of"  -  ^  (3.6) 

i=0  t=0 

z'"  u;  -  A'vz^-u, = f;ir''i«.[cr  -  E  i^r;oi>']]i.  (3.?) 

<=0  t=0 

The  following  hypothesis  is  implicit  in  all  subsequent  problems  and  theorems. 

Hypothesis  3.1  Then  exist  independent  sequences  u,  w,  and  v  in  the  Hilbert  spaces  U ,  W,  and  C", 
nspectively.  Each  of  these  sequences  has  bounded  autocorrelation,  w  is  white,  and  u  satisfies  (2.8). 
Then  exist  a  Hilbert  space  Y  and  absolutely  summable  sequences  Zj“  €  B{U,Y),  ZjJ“'  €  B(W,Y), 
cl“  €  U,  c;"  €  W,  ZJ“  €  U',  and  ZJ"  €  W  with  cj“  niated  to  ZJ"  and  cj"  nlated  to  ZJ""  as  in 
(3.1).  The  sequences  Lq"w  and  an  independent  of  Ll^'w  and  L\'"w  Vik  >  1. 

In  the  subsequent  estimation  and  prediction  problems,  u  is  a  known  input  sequence  to  a  linear 
system,  w  is  an  unknown  noise  sequence,  and  v  is  another  unknown  sequence,  often  to  be  estimated. 
Process  noise  and  measurement  noise  are  components  of  w.  The  measurement-noise  sequences  are 
Zo*"!!/  and  w.  The  inner-product  space  Su  9  S,u  of  equivalence  classes  of  scalar  sequences  is 
constructed  as  in  Section  2. 
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FYom  here  on,  the  notation  in  this  paper  does  not  distinguish  between  a  scalar  sequence  |[9~'u;,  ij] 
toi  t)  £  W  and  the  equivalence  class  of  all  scalar  sequences  equivalent  to  [9~'u',  q]-  The  notation 
is  abused  similarly  for  u  and  the  output  sequences  y  and  s  in  the  following  problems.  Under 
Hypothesis  3.1,  =  c*“  —  a*“  —  ’!•  “  ~  5Z*=o  absolutely 

sununable  sequences  in  U  and  W,  respectively.  Therefore,  the  right-hand-sides  of  (3.6)  and  (3.7) 
are  dements  of  5u  and  ,  respectively. 

Piv^blem  3.2  (LS:  Infinite  Length  ARX)  Lti  ike  Y-valued  isquence  y  and  ike  scalar-valued  se¬ 
quence  s  satisfy 

y=I»“u-t- (3.8) 

s  =  L‘'‘u  +  L’^w-i-v.  (3.9) 

Find  absolutely  summable  sequences  a**  =  (aj*  a**  aj*  . . . )  in  Y  and  a*“  =  [aj“  a*“  aj"  ...  1  in  U 
to  minimize 

j£(a*»,a‘“)=  |A*»y-l-A*“u-s|’  (3.10) 

subject  to 

al^=Q.  (3.11) 

Problem  3.3  (LS:  Finite  Length  ARX)  Assume  ike  kypoikeses  in  Problem  3.2.  Let  N  be  a 
positive  integer,  and  find  sequences  a**  in  Y  and  a*“  in  U  to  minimize  7f:Ar(t***',a*“)  = 
subject  to  (3.11)  and 

al''=0,  ai“  =  0,  ^k>N.  (3.12) 

(The  functional  J£;,v(a*'',a'“)  is  just  j£;(a**',a*“)  defined  for  parameters  a**  and  a*“  that  satisfy 
(3.11)  and  (3.12).) 

Problem  3.4  (LS:  HR)  i4ssume  tke  kypoikeses  in  Problem  3.2.  Find  an  absolutely  summable 
sequence  a*”  in  U  to  minimize 

^£(0,a*“)  =  (3.13) 

Problem  3.5  (LS:  FIR)  Assume  ike  kypoikeses  in  Problem  3.2.  Let  N  be  a  positive  integer,  and 
find  an  absolutely  summable  sequence  o'"  in  U  to  minimize  y£jv(0,a'“)  =  J£(0, a'“)  subject  to 

al“  =  0,  \fk>N.  (3.14) 

In  Problems  3.2-3.5,  the  parameter  sequences  a**'  and  a'“  are  chosen  to  miriimize  the  norm  (i.e., 
I  ■  I)  of  the  prediction-error  sequence 

N  N  N  N 

e  =  ^Ai^q-'y-\-Y^Al'‘q~'u  -  s  =  ]^Ig"*y.on  +  “  »•  (3  15) 

i=l  1=0  t=l  1=0 

(Recall  (2.5)  and  (3.1)-(3.3)).  The  k^^  term  in  this  sequence  is 

N  N 

e*  =  i;(yt-.,a:»)-H53(«*_<,on-  k>0,  (3.16) 

•=i  »=o 

with  y,  =  0  and  =  0  for  i  <  0  (recall  (2.2)).  In  Problems  3.2  and  3.4,  =  oo;  in  Problems  3.4 

and  3.5,  a*»  =  0. 
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Problem  3.6  (LQR:  Infinite  Interval)  l^tX  be  a  Hilbert  space  andT  €  B{X,X),  B"  €  B{U,X), 
B*  6  B(W,X),  C*  €  B(  V,y),  C*  €  X\  Let  the  spectral  radius  ofT  be  less  than  1,  and  let  c"  be 
ike  element  of  X  that  satisfies 

C‘z  =  {x,c*‘)  Vx.  (3.17) 

(The  element  e"  is  independent  of  x.)  Find  a  square-summable  sequence  v  in  Y,  with  vo  =  0,  such 
that  V  mintmue*  ^ 

Jcly) = (318) 

•=i  (=1 

where  the  sequence  (  tn  X  satisfies 

6  =  -c‘*.  Vfc>l.  (3.19) 

Problem  3.7  (LQR:  Finite  Interval)  Assume  the  hypotheses  tn  Problem  3.6.  Find  a  sequence 
V  in  Y,  with 

1/0  =  0,  t/t  =  0  Vfc>N,  (3.20) 

5ttc/.  that  V  minimizes 

N  N 

JcN{y)  =  X;<B”'B“''^i,B“'*^i)  +  +  <G^n+i,:v+i)  (3.21) 

<=i  1=1 

where  the  sequence  (  tn  X  satisfies  (3.19)  and 

eo 

G  =  5^T'[B“'B“'B“'*  +  B“B“B“*)(T*)*.  (3.22) 

i=Q 

The  operator  G  in  (3.22)  is  the  unique  element  of  B(X,X)  that  satisfies  the  Lyapunov  equation 

rGT*-C;=-[B”'B“'B"'*  +  B“B"B“*].  (3.23) 


Theorem  3.8  Assume  the  hypotheses  in  Problems  3.2  and  3.6.  Assume  further  that,  for  h  >  1, 


LJ"  =  C7»r*-‘B“', 

(3.24) 

=  C"T^-^B'‘, 

LI'"  =  C*r*"‘B“'. 

(3.25) 

If  sequences  and  a*“  solve  Problem  3.2,  then  the  sequence  v  =  a**'  solves  Problem  3.6.  Con¬ 
versely,  if  a  sequence  v  solves  Problem  3.6,  then  the  sequence  a**  =  i/  and  the  sequence 

=  t>l,  a5“  =  c5“.  (3.26) 

solve  Problem  3.2.  (The  sequence  ^  is  generated  by  (3.19).^  If  the  autocorrelation  operator  LlK"  IV" 
is  coercive,  there  exists  exactly  one  sequence  v  that  solves  Problem  3.6.  If  u  satisfies  (2.23),  then, 
for  any  pair  of  sequences  a'*  and  a*“  that  solve  Problem  3.2,  (3.26)  holds  with  (  generated  by  (3.19) 
with  u  =  a**'. 

Theorem  3.9  i45Sttme  the  hypotheses  in  Problems  3.3  and  3.7,  assume  that  (3.24)  and  (3.25)  hold 
for  k  >1,  and  let  u  be  white.  If  sequences  a'*  and  a*"  solve  Problem  3.3,  then  the  sequence  v  =  a*<' 
solves  Problem  3.7.  Conversely,  if  a  sequence  v  solves  Problem  3.7,  then  the  sequence  a**'  =  v 
and  the  corresponding  sequence  a*“  given  by  (3.26)  solve  Problem  3.3.  If  IV"  L'jf"*  is  coercive, 
there  exists  exactly  one  sequence  v  that  solves  Problem  3.6.  //B”  is  coercive,  then,  for  any  pair  of 
sequences  a**  and  a*"  that  solve  Problem  3.3,  (3.26)  holds  with  (  generated  by  (3.19)  with  v  =  a'*'. 
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Theorem  3.10  A$$%mt  that  (3.24)  anJ  (3.25)  koU  /or  k  >  1.  If  ike  kypoikeoeo  in  Problem!  3.2 
3.6  koU,  iktn 

,  min  ,  Jb{v, a*”)  =  min  Jc(i/)  +  (H^C. cS")  +  (3-27) 

{v:V0sO}  y 

If  ike  hypotheses  tn  Problems  3.3  anrf  3.7  hold  ond  u  is  white,  then 

^  min  ,  JBs{y,  «*“)  =  min  Jcsi^')  +  (iT C.  O  + 1"!*-  (3-28) 

{vtOosO}  " 

Theorem  3.11  A  oobttion  to  Problem  3.4  u  a*"  =  e*".  If  u  oatisfies  (2.23),  <&e  solution  to 
Problem  3.4  is  enifue.  Ifuis  toltle,  a  solution  to  Problem  3.5  i« 

a{“  =  cj“.  0<t<Ar.  (3.29) 

If  u  is  white  with  A"  coercive,  the  solution  to  Problem  3.5  is  unique. 

Proof  of  Theorem  3.8  Under  the  hypotheses  in  Problem  3.2, 

JB(a**.a*“)  =  +  yl*“u  -  +  |v|*,  (3.30) 

=  I  [cr  -  ar  -  ^  irria:"]  ii*.  (3.31) 

•=0  k=0 

M"!,'"..-  = £(«•[.;■■  -  EirrM'i.w  -Ei?r;«:'i> 

isl  ksO  tsO 

+ (332) 

i=0 

Since  cj“,  cj*,  I«J“,  and  LJ®  are  related  as  in  (3.1),  (3.25)  implies 

cj“  =  B“*(r*)*-*c**,  cj”' =  A”'*(r*)*-*c**,  *>1.  (3.33) 

Since 

=  -(r)'-»c‘*  +  53(r  )‘->-*c«'Vt,  i  >  1,  (3.34) 

i=l 

it  follows  from  (3.24)  that,  if  i/q  =  0, 

cr  -  E  *■  >  1.  (3-35) 

t=o 

cr  -  E  ^  1-  (3.36) 

fc=0 

Hence,  if  i^)  =  0, 

j£(,/,a*“)  =  Jc(i/)  +  {ff“ctr,cn  + 

+lll«.c5“  -  aS“l  +  El«-«.(B“*€i  +  !?“**'<  +  oDll^  (3.37) 

•=i 
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That  V(l“  R“  being  coercive  is  sufficient  for  the  existence  of  exactly  one  solution  to  Problem  3.6 

is  well  known.  All  of  the  other  statements  in  the  theorem  follow  from  (3.37).  □ 

If  u  is  white  also,  then 

|A*»I»“u  +  A*“u-L*“u|*  = 

iii«,  cs“  -  «s“i + =  (3-38) 

i=l 

(ie“(c5“  -  o5“),  (cS“  -  aS“))  +  +  irui  +  an.  (B“*6  +  Lr>'i  +  an) 

i=l 

where  (  satisfies  (3.19)  with  1/  =  a‘*. 


Proof  of  Theorem  3.9  When  1/*  =  0  V4  >  (3.19)  yields 

(i  =  t  >  N  +  1.  (3.39) 

Then,  when  (3.11)  holds,  (3.37)  with  1/  =  a**  and  (3.38)  yield 

W  N 

JEN{a*\an  =  al» , 

t  =  l  $=1 

00 

+  53((B“'ii“'B"*  +  B“B“B“*]7-^A,+i,r*(w+i)  (3.40) 

i=0 

S 

+(B“(cS“  -  an,  (C  -  an)  +  +  ^rv.-  +  an,  +  Lr^'i  + 

isl 

+(B“'c5-,cr)+N^ 

i.e.,  when  (3.11)  holds  and  u  is  white. 


JBNia’\an  =  JcN(a*>')  +  (B”'cS”',c5“')  +  hf  +  (B“(c5“  -  a5“),(c5"  -  aj")) 
+  53(B“[B“*^<  +  Lnal»  +  oJ«],  [B“*e<  +  Lral>  +  a;"]). 


t=i 


□ 


(3.41) 


Proof  of  Theorem  3.10  From  (3.37)  and  (3.41).  □ 

Proof  of  Theorem  3.11  With  a**'  =  0,  (3.30)  reduces  to 

Jb{0,  an  =  |A*“«  -  +  |L*“'u;|*  +  |t;f  (3.42) 

=  iif;(4-‘«.(cr  -  «r)ii" + 

•=o 

When  u  is  white. 


N 


JBN{0,an  =  '£{ff‘{cr-an,{cV‘-an)+  £  (B“c:“,cr)  +  |B*“'u;p+H^  o  (3.43) 

<=0  i=N+l 
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4  Minimax  Estimation  and  Prediction 


FYom  here  on,  we  assume  Hypothesis  3.1,  the  hypotheses  of  Problem  3.6,  and  (3.24)  and  (3.25)  for 
ib  >  1.  We  also  assume  the  following  hypothesis. 

Hypothesis  4.1  There  exist  a  Hilbert  space  Z  and  sequences  £.“  €  B{U,Z)  and  LI*"  €  B(W,Z). 
The  sequence  Lq^uj  is  independent  ofLl'“w  and  L^w  VI  >  1,  and  L^w  ts  independent  of 
y/k  >  0.  T&e  sequence  u  is  white. 

We  define  operators  and  £>'*'  as  in  (3.3).  For  each  absolutely  summable  sequence  a“  €  Z, 
we  define  a  corresponding  sequence  of  linear  functionals  and  an  operator  A"  as  in  (3.1)-(3.3). 
This  section  concerns  the  linear  system  in  (3.8)  and  (3.9)  with  the  additional,  Z-valued  measurment 
sequence  z  satisfying 

z  =  L*'‘u  +  L*'“w.  (4.1) 

In  some  applications,  x  =  s. 

We  use  the  following  notation  for  finite  and  infinite  sequences 


•y  /.*» 


l:t  “*+l  “t+2 


...]. 


For  each  positive  integer  N,  we  define  the  prediction-error  sequence 


(4.2) 


N  N  N 

e  =  A^q~^u  +  -f  '^  A\fq~^z  —  s, 

k=0  k=l 

and  the  fit-to-data  criterion 

=  tef  -  7*i;  K  I*. 

ksl 

where  7  is  a  nonnegative  real  number. 

Definition  4.2  A  real-valued  function  f{y^ is  coercive  in  1/*  if 


(4.3) 


(4.4) 


(4.5) 


where  fo  is  a  fixed  real  number,  fi{v^ ,v'‘ ,1^*)  is  0  linear  function  of  (»/>',  i/“,«/*),  and  there  exists  a 
positive  real  number  p  such  that 

f2ii^,0,0)>py\\  (4.6) 

If  the  corresponding  condition  holds  for  j/"  or  t/* ,  then  f  is  coercive  in  j/“  or  v‘ ,  respectively. 


Problem  4.3  (Minimax:  Finite  Length  ARX)  Suppose  that  •/£7v(aj!'^,ag"^,aJf^)  is  coercive 
in  a\*ff  and  ajjy  and  that  —Jen  is  coercive  in  ojfyy.  Find  sequences  in  Y,  in  U,  and 
in  Z  that  satisfy  the  saddle-point  condition 

JBN(<ki^N,0'o“N>*'l:N)  ^  '^Bn(o{^n ,Oo“n ^  •  *'0;iV> “if/v)  (^-7) 

for  all  sequences  in  Z,  in  Y,  and  in  U. 


Next  we  define 

•^£jv,jv(oJjviaojv>®ifAr)  =  ^bn{<^un<°o"n>^i‘n)-  (^•®) 

For  1  <  ib  <  JV,  when  JenmiH^i^  ®i+i])  “  coercive  in  aJ|^j  and  aj“  j  and 

—JBN,k-x-i  is  coercive  in  aJ^j  (for  aJ!j,  aj!^,  and  ajf^  fixed),  we  define 

JBN,k{a[*k,ao^k,aV.k)  =  min  max  j£rAr.t+i([a;ft  i7»],K“t  »?*])•  (4-9) 

tf^^Y  *7  €46 
fl*€U 
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(According  to  (4.2),  =  (njf*  ol+ili  etc.)  We  also  define 


^BJV.o(0,aS“,0)  =  min  inax  j£;jv,j(i/>',[o5“r;“],»/*). 

n*tY  •)  €« 
tl'€V 


(4.10) 


Theorem  4.4  If  ./£Af,*+i([aI!*  <>*+1])  “  coercive  in  aJIJ.,  and  ojlj.,  and 

—l£jv,i+i  “  coercive  in  aJ^j  V  t  =  0, 1, . . .  ,.A^  —  1,  i&en  for  each  tuck  k  there  exist  hounded  linear 
operators  . ^N,k  »ec<ore  encA  that 


JEN.k{a\:k^<^M\)  =  J£N.*+i(Kf*  a'4J.{aS“*  ai;i]) 

(4.11) 

with 

®t+l  =  ^V/.k°‘V.k  +  +  RV,k°’Vk  +  ^*+1 . 

(4.12) 

at“  1  =  K'if.aW  +  +  d>U„ 

(4.13) 

ai;i  =  K’pf.aW  +  Kh-kayf,  +  +  d>Uu 

(4.14) 

<■0  =  0.  ajfo  =  0. 

(4.15) 

Furthermore,  for  each  k,  the  uj+i,  and  oJ^j  for  which  (4.11)  holds  are  unique. 

This  theorem  follows  from  the  assumed  coercivity  and  the  fact  that  Jbn  is  e  linear-quadratic  func¬ 
tional. 


Problem  4.5  (Order-recursive  Minimax:  Finite  Length  ARX)  Suppose  that 
JEN,k+i{[a{%  <»i+i].[“o“i:  “  coercive  in  and  oJ“j  and  -Jjsw.t+i  is  coercive 

in  aJ^^j  V  i  =  0, 1, . . . ,  iv  —  1.  Find  a'^pf  and  Oojv  such  that  aj“  minimizes  Jen,o(^>°o*>^) 

=  nVx-k  +  nvz  +  0  <  A  <  JV  -  1,  (4.16) 

°*+i  =  +  f(N>"k  +  <^Ui,  0<k<N-l,  (4.17) 

ajfo  =  0.  (4.18) 

Now  we  consider  the  optimal  control  problems  of  finding  open-loop  and  closed-loop  saddle-point 
strategies  for  the  performance  index 

Af 

»=i 

-  7*f;  ki*  (4.19) 

i=l  1=1 

where  G  is  the  operator  in  (3.22)  and  the  sequence  (  in  X  satisfies 

6  =  -c",  =  Vfc>l.  (4.20) 

We  use  the  standard  notions  of  saddle-point  strategies  for  linear-quadratic  games. 

Problem  4.6  (Open-loop  LQ  Game:  Finite  Interval)  Suppose  that  Jcn{>^ n,yI  n)  is 
coercive  in  i/f.jy  and  —JcN  is  coercive  in  j/f.jy.  Find  sequences  oj!'^  in  Y  and  a*f^  in  Z  that 
satisfy  the  saddle  point  condition 

S  •^CAr(oijv,Ojjyf)  <  Jcn{x^:n^°’V.n)  (4-21) 

for  all  sequences  in  Z  and  in  Y. 
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We  will  define  cloeed-loop  saddle-point  strategies  in  a  way  that  is  most  relevant  to  Problem  4.5, 
but  our  definition  here  is  equivalent  to  the  standard  definition  (see  [6],  for  example)  of  closed-loop 
saddle-point  strategies  for  (4.19)  and  (4.20).  We  define 

^CJV,Jv(*^;Af,*'J;N)  =  (4-22) 

For  I  <k  <  N,  when  “  coercive  in  and  -Jcs,k+i  ««  coercive 

in  (for  and  ^ed),  we  define 

JcN.ki'^i.kM.k)  =  m“*^civ.4+»(K:t  »?'])  (4.23) 

Theorem  4.7  If  JcNMi([‘^:k  “  coercive  in  and  -JcN,k+i  coercive  in 

V  fc  =  0, 1, . . . ,  N  —  1,  <Aen  for  each  tuck  k  there  exist  bounded  linear  operators  and 
such  that 

JcN.k(.t/[..k,vlk)  =  JcN.k-kl{H:k  Kt  *'t+i])  (4.24) 

with 

t'J+i  —  (4.25) 

li+i  =  -FtV,6+i-  (4.26) 

Also,  the  unique  sequences  a[*ff  and  a'jy  that  solve  Problem  4.6  are  aW  =  and  a*  jy  =  i/j jy 
with  and  i/^  jy  generated  by  (4.25)  and  (4.26). 

This  theorem  follows  from  the  assumed  coercivity  and  the  fact  that  Jcn  is  a  linear-quadratic  func¬ 
tional.  The  feedback  control  laws  in  (4.25)  and  (4.26)  are  the  pair  of  closed~loop  saddle-point  strate¬ 
gies  for  (4.19)  and  (4.20). 

Our  formulation  of  the  order-recursive  minimax  parameter-estimation  problem  and  the  closed- 
loop  saddle-point  strategies  for  the  control  problem  are  motivated  by  the  following  standard  results 
on  LQ  dynamic  games  [6].  There  exists  a  sequence  of  Riccati  operators  Pk,  generated  by  a  Riccati 
difference  equation  backward  from  the  final  condition  Pn+i  =  G,  such  that 

k 

JcNA^uyik) = -b  {Pk^iik+i,M 

i=l 

-h  v.*^> + {Li'^irLr^,ut)]  -  7*  ^  ki*  (4.27) 

•=1  i=l 

for  any  sequences  and  The  gain  operators  and  F^^^^  in  (4.25)  and  (4.26)  can  be 
constructed  from  Pt^.i  and  the  operators  in  (4.19)  and  (4.20).  See  Section  5.2.  The  standard  results 
on  optimal  control  do  not  appear  to  provide  an  algorithm  for  solving  the  parameter-estimation 
problem,  but  they  make  the  solutions  to  Problems  4.6  and  4.8  trivial. 

Problem  4.8  (Closed-loop  LQ  Game:  Finite  Interval)  Suppose  that  the  coercivity  hypotheses 
tn  Theorem  4.7  hold.  Find  the  sequence  i/f.jy  generated  by 

•'t  =  (4.28) 


and  (4.20)  with  i/f.jy  =  0. 

Theorem  4.9  Suppose  that  the  coercivity  hypotheses  tn  Problem  4.3  hold.  Then  the  coercivity  hy¬ 
potheses  in  Problem  4.6  hold,  and  the  finite  sequences  Ug^y  and  a'fjy  solve  Problem  4.3  if  and 
only  ifa{^ff  and  solve  Problem  4.6  and 

oJ“  =  -B“*e.-LrX’'-^o"*ar.  «>1.  a5“=c5“.  (4.29) 
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Thecwem  4.10  Suppose  that  ike  coerciviip  hypotheses  tn  Problem  4.5  hold.  Then  the  coercivity 
hypotheses  tn  Theorem  4.7  hold,  and  the  finite  sequences  a\^ff  and  agjy  solve  Problem  4.5  if  and 
only  if  =  ojfjy  solves  Problem  4.8  and 

ar  =  -B^*(i-Lraf>.  i>l,  a5“  =  cS“.  (4.30) 

Proofs  of  Theorems  4.0-4.16  In  (3.30)-(3.41),  we  include  terms  corresponding  to  the  measure¬ 
ment  z  in  (4.1)  to  obtain 

j£N(airjv.«o“N.«"N)  =  ^CN(aI!'yv.‘»ifiv)  +  («“(4“  -  a5“),(c5"  -  oS“)) 

s 

+  +  LS“*aJ»  -t-  oD,  -I-  +  LS“’a;*  +  oJ“])  (4.31) 

i=l 

+{R'“ctrAn+ivV 

where  •/cw(aiV>®*fv)  “  given  by  (4.19),  ^  satisfies  (4.20)  with  and  and 

(iJ"cJ“',  eg*")  and  |vp  are  constants  independent  of  oJVjy,,  and  ojf^.  The  following  lemma  then 
implies  Theorems  4.9  and  4.10.  The  subsequent  Theorems  4.15  and  4.16  follow  from  (4.31). 


Lemma  4.11  Let  Hi  (i  =  1,2, ...,5^  6e  Hilbert  spaces  with  Lij  €  B(Hi,Hj),  C4  €  H4,  C5  €  H^. 
Define  J(*,  •)  :  Hi  x  H3  —*  R.  and  J(-,  •)  :  Hi  x  H3  x  H3  — »  R.  iy 

•^(^ii^s)  =  +  ^43*3  +  C4I*  —  7*1^3!*.  (4-32) 

^21^3)  =  -/(hi,  A3)  +  |X>siAi  -1-  Aj  L53A3  +  C5I*,  (4.33) 

wAere  7  is  a  positive  real  number,  and  consider  the  saddle-point  conditions 

•^(AjiAa)  <  J(Ai,A3)  <  j(hi,h3)  V  Aj  €  Bj,  A3  G  B3,  (4-34) 

^‘(Ai.Aj.  Vi)  <  i(Ai,A2,/i3)  <  JihuhzM)  V  Ai  G  Hi,  A*  G  Bj,  A3  G  B3.  (4.35) 

Suppose  that  (4.35)  holds  for  some  (Aj,  A2,  A3)  G  Bi  x  B2  x  B3.  Then 

A2  =  — (LsiAi  -I-  £>53^3  -i-  cs),  (4.36) 

and  the  following  one-to-one  correspondence  exists  between  saddle  points  of  J  and  saddle  points  of 
J:  (4.34)  holds  for  some  (Aj,  A3)  G  Bi  x  B3  if  and  only  i/(4.35)  holds  for 

hi  =  hi,  h2  =  — (I>5iAi  -h  Lishz  -t-  cs),  A3  =  hi.  (4  37) 

This  lemma  follows  from  the  first-order  necessary  conditions  (normal  equations)  for  saddle  points  of 
linear-quadratic  functionals  and  the  fact  that  J  is  concave  in  A3  if  J  is. 


Theorems  4.9  rmd  4.10  say  that  Problems  4.3  and  4.5  are  equivalent,  respectively,  to  Problems  4.6 
and  4.8  when  7  is  large  enough  for  the  appropriate  objective  functionals  in  Problems  4.3  and  4.5  to 
be  coercively  concave  (i.e.,  —  J  coercive)  in  the  maximizing  parameters  ajfjy.  Because  of  the  term 
on  the  second  line  of  (4.31),  7  generally  must  be  larger  for  the  coercivity  hypotheses  in  Problems  4.3 
and  4.5  than  for  the  coercivity  hypotheses  in  Problems  4.6  and  4.8.  The  following  alternative 
minimax  parameter  estimation  problems  have  coercivity  hypotheses  that  are  equivalent  to  those  in 
the  corresponding  control  problems. 

We  define 


•^Fiv,Af(oijy,a*fjy)  —  •fBAr(fliV.fllfY)  —  n>in  j£;v(aJ!yy,ao:^,ajfj^).  (4.38) 

•OsN 
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Problem  4.12  (Minimax:  Finite  Length  ARX)  Suppose  that  •/£N(a'jv>°!f7v)  ^  coercive  in 
—Jen  w  coercive  in  Find  sequences  ajjy  in  V  and  a\‘^  in  Z  that  satisfy 

the  saddle-point  condition 


jEN{ay/^,t/yff)  <  jEN(ayff,ayf/)  <  /£/sr(»^jvi“ifAr) 

(4.39) 

for  all  sequences  in  Z  and  in  Y,  and  for  this  ay^  and  find  aj^y 

such  that 

jEN{ayf/,ayf,,)  =  JEN(ayf/,aQ*j,/,ayf/). 

(4.40) 

For  1  <  k  <  N,  when  JEN,k+i([ayi,  al+i].[alft  «  coercive  in 

coercive  in  aJ^j ,  we  define 

and  —JEN,k+i  is 

/£7v,*(a*»j,a‘i't)  =  imn  max  JBjv,t+ :([<**  »?*])• 

(4.41) 

Theorem  4.13  If  JEN,k+\{[ayi,  uj+i])  «s  coercive  in  and  —JEN,k+i  i*  coercive  in 

V  i:  =  0, 1, . . . ,  N—l,  then  for  each  such  k  there  exist  bounded  linear  operators  K^h, 

Kff  k  vectors  such  that 

jEN.k{a\\,a‘^,,)  =  JEN.k.,i{Kk  «i+i]) 

(4.42) 

with 

(4.43) 

®t+l  =  ^N,k°l\  +  +  ^t+l> 

(4.44) 

®*?o  ~  0,  Ojfo  =  0. 

(4.45) 

Furthermore,  for  each  k,  the  and  for  which  (4.42)  holds  are  unique. 


This  theorem,  like  Theorem  4.4,  follows  from  the  assumed  coercivity  and  the  fact  that  j£s  is  a 
linear-quadratic  functional. 

Problem  4.14  (Order-recursive  Minimax:  Finite  Length  ARX)  Suppose  that 
/£:Ar,k+i([aJf*  “t+il.laif*  ®*+il)  m  coercive  in  and  -JBN.k+i  «*  coercive  in 
'd  k  =  0,1, . . .  ,N  —  1.  Find  a\*ff  such  that 

<1  =  j<NVuk  +  ^Uv  0<k<N-l,  (4.46) 

al!'o  =  0.  (4.47) 

and  for  this  find  Ugjy  such  that 

•f£7v(aj!';v,0)  =  J£;Af(aI!W,ao“jViO)-  (4-4«) 

Theorem  4.15  The  coercivity  hypotheses  in  Problem  4.12  hold  if  and  only  if  the  coercivity  hypothe¬ 
ses  in  Probl  m  4.6  hold.  When  these  hypotheses  hold,  the  finite  sequences  Ogjv  nnd  ojfyy  solve 
Problem  4.12  if  and  only  if  a‘^fi  and  ajf^y  solve  Problem  4.6  and  Oqjv  •*  9iwn  by  (4.29). 

Theorem  4.16  The  coercivity  hypotheses  in  Problem  4.14  hold  if  and  only  if  the  coercivity  hy¬ 
potheses  in  Theorem  4.7  hold.  When  these  hypotheses  hold,  the  finite  sequences  and  solve 
Problem  4.14  if  and  only  if  solves  Problem  4.8  and  is  given  by  (4.30). 
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5  Relationships  to  State-Space  Filtering 

5.1  LQR/Kalman  Filter  Representations  of  LS  Parameters 

We  begin  with  the  finite-order  LS  problem  and  the  finite-time  control  and  filtering  problems.  Ac¬ 
cording  to  standard  results  on  LQR  problems,  the  solution  to  Problem  3.7  has  the  form 

Vk  —  —Ftik  (5.1) 

where 

F*  =  +  C»Pt+iC*’]-‘C*P*+ir  (5.2) 

and  the  Riccati  operators  Pt  are  the  nonnegative,  self-adjoint  elements  of  B(X,X)  satisfying 

p*  =  -I-  r(p*+i  -  p*+iC»*[ig“'^“'io"'*  +  c»Pt+iC**)-*c»Pt+,)r,  (5.3) 

Pn+1  =  G.  (5.4) 

The  corresponding  optimal  sequence  (  satisfies 

i<t<N-n,  (5.5) 

where 

fk  =  T-F;C>,  \<k<N,  fo  =  I.  (5.6) 

With  (5.1)  and  (5.5),  the  solution  to  Problem  3.3  given  in  Theorem  3.9  becomes 

aj'  =  P*Tr_ifj_2  •  •  TJc".  1  <  i  <  ^.  (5.7) 

l<k<N,  aS“  =  c5“.  (5.8) 

According  to  (3.1)  and  (3.17),  the  bounded  linear  functionals  correspondmg  to  the  a]/'  and  in 
(5.7)  and  (5.8)  are 

Al’'  =  C'fo  ■  ■■fk.2fk.1Fi,  l<k<N,  (5.9) 

AJ”  =  C*fb  l<k<N,  AJ"  =  LJ".  (5.10) 

Under  the  hypotheses  of  Theorems  3.8  and  3.9,  a  state-space  realization  of  the  input/output 
system  in  (3.8)  and  (3.9)  is 

x*+i  =  Txt  +  B'‘uk  -h  (5.11) 

=  (5.12) 

St  =  C**t  -I-  I5“ut  -I-  L’o'^wk  -1-  V.  (5.13) 

(Recall  that  u  is  a  known  input  sequence  while  w  and  v  are  unknown  noise  sequences.)  If  y  is  the 
measured  output,  a  K2dman  filter  for  one-step-ahead  prediction  of  zt+i  has  the  form 

ft+i  =  fkik  hvk  +  [P”  -  hLDxik,  (5.14) 

and  the  corresponding  one-step-ahead  prediction  of  st^.i  is 

«t+i  =  <^***+1  +  Lo“ut+i  =  (xt+i,c")  Lo“«i+i.  (5.15) 

(The  xt  here  often  is  denoted  by  xt/t-i*  See  [20],  for  example.)  If  the  expected  value  of  xi  is 
xi  =  0,  then 

Xyv+l  =  FnVN  +  [P“  -  FArL5“]«Af 
JV-l 

-I-  Y,  TNfN.l  •  ■  fN^kl-iih.iVN.i  +  [P“  -  Pyv-.Lr]«N-.).  (5.16) 
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If  the  probabilistic  error-covariance  operator  for  xi  is  G,  then 


Ft  =  i=’w+i_t,  n  =  fN+i-k,  l<k<N,  (5.17) 

where  Ft  and  Tt  are  the  operators  in  (5.1)-(5.6).  Substituting  (5.16)  into  (5.15)  and  using  (5.17), 
(5.7),  and  (5.8)  yield 

N  N 

es+i  =  y^XyN+i-i,a**)  +  ^(ujv+i_<,a*“) 

•=1  >=0 

N  N 

=  EA‘V+i-i  +  (5.18) 

<=1  >=0 

Thus  the  ARX  coefficients  that  minimize  the  least-squares  one-step-ahead  prediction  error  in 
Problem  3.3  are  the  coefficients  in  the  probabilistic  minimum-variance  prediction  of  is+i  based  on 
the  data  yi, y2i •  ••  VAf  &nd  ui,U2,...ujv  and  the  assumption  that  the  initial  state  vector  xi  has  zero 
mean  and  covariance  G,  given  by  (3.22).  This  G  is  indeed  correct  for  the  steady-state  statistics  of 
the  state  vector  xt  in  (5.11)  when  T  has  spectral  radius  less  than  1  and  and  wt  are  zero- mean 
stationary  white  noise  sequences  (in  the  probalistic  sense)  with  covari2mce  operators  IV*  and  /f", 
respectively. 

When  we  say  that  the  parameters  aj*  and  a*“,  or  equivalently  and  j4f“,  are  Markov  pa¬ 
rameters  for  a  state-space  filter,  we  mean  that  they  are  the  coefficients  in  a  prediction  formula  like 
(5.18). 

Now  we  turn  to  the  infinite-order  LS  problem  and  the  infinite-time  control  and  filtering  problems. 
The  solution  to  Problem  3.7  is  given  by  (5.1)-(5.6)  with  Ft,  Ft,  and  7*  independent  of  k.  Hence 


Ft  =  F,  Pk  =  P,  tk  =  f=T-F’a>,  (5.19) 

and  the  solution  to  Problem  3.2  given  in  Theorem  3.8  becomes 

ajy  ^  k>l,  (5.20) 

at"  =  V' -  *>1.  Oo“  =  cS“-  (5-21) 

For  the  corresponding  bounded  linear  functionals,  (5.9),  (5.10),  and  (5.17)  yield 

Al^  =  G*r<‘-‘)F,  ifc  >  1,  (5.22) 

=  t>l,  i4S“  =  LS".  (5.23) 


The  linear  functionals  and  A]^-'  in  (5.22)  and  (5.23)  should  be  recognized  easily  as  the  Markov 
parameters  for  the  steady-state  Kalman  filter  for  the  system  in  (5.11)-(5.13)  with  measured  output 
y  and  predicted  output  s. 
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5.2  LQ  G&me/Hoo  Filter  Representation  of  Minimax  Parameters 

According  to  standard  results  on  LQ  games  (see  [6]  for  example),  the  solution  to  Problems  4.6  and 
4.8  have  the  form 

(5.24) 

aY  =  =  -FUk,  (5.25) 

where 


=  [i2-i-cp*+icr)-‘cpt+ir*, 


R  = 


0 


i.  -i?i. 


(5.26) 

(5.27) 


0  -y^i) 

and  the  Riccati  operators  Pk  are  the  nonnegative,  self-adjoint  elements  of  B(X,X)  satisfying 

Ft  =  +T(Pk+i  -  Pt+,C*[/e-t-CP*+jC’]-'CPi+i)r*  (5.28) 


and  the  final  condition  (5.4).  The  corresponding  optimal  sequence  (  satisfies  (5.5)  with  (5.6)  replaced 
by 

fk=T-  Fjl'C  -  l<k<N,  To  =  /.  (5.29) 

The  coercivity  hypotheses  in  Theorem  4.7  hold  if  and  only  if  the  operators 
and  [727  -  7,J"'72“'LS“'’  -  C^Pt+iC**]  are  coercive  for  1  <  ib  <  A^. 

With  (5.24),  (5.25),  (5.5),  and  (5.29),  the  solution  to  Problems  4.3  and  4.12  given  in  Theorems  4.9 
and  4.15  becomes 


l<k<N, 

(5.30) 

aY  =  ■  •  fo*c"  -  ir'a’k*  -  -to-ar . 

l<k<N,  o*o"=c*o“. 

(5.31) 

-$t  _  riT**  T** 

Ok  —  r**  ■  ioC  , 

The  corresponding  bounded  linear  functionds  are 

l<k<N. 

(5.32) 

AY  =  C’fo  -fk-7fk-iFr, 

l<k<N, 

(5.33) 

AY  =  C'fo  •  ■  ■fk-2fk-iB'‘  -  AYir  -  AYLY 

l<k<N,  AY  =  LY, 

(5.34) 

AY  =  C’fo  -fk.7fk-iFr, 

\<k<N. 

(5.35) 

By  the  standard  results  on  LQ  games,  the  operators  F*  in  Theorem  4.7  and  Problem  4.8  are  those 
in  (5.25)-(5.29).  Hence,  by  Theorems  4.10  and  4.16,  the  sequences  and  in  the  solution  to 
Problems  4.5  and  4.14  are  given  by  (5.30)  and  (5.31)  with  ajfyy,  =  0  and 

fk  =  T-  Fl'CP,  l<k<N,  fo  =  I.  (5.36) 

The  role  of  LQ  g2Lme  theory  in  /7oo  control  and  filtering  stems  from  the  following  well  known 
result,  which  is  a  corollary  to  Theorem  4.7. 

Corollary  5.1  Suppose  that  the  coercivity  hypotheses  in  Theorem  4.7  hold,  that  is  generated 
by  the  minimizing  closed-loop  saddle-point  strategy  in  (4.25)  and  (5.24),  and  that  =  0.  Then 

•fcN('^:jV>*'l;Jv)  S  0  ^ ‘'In-  (5.37) 
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To  interpret  (5-37)  in  the  most  useful  way  for  this  paper,  we  assume  that  G  can  be  factored  as 


G=BB* 


where  B  is  a.  bounded  linear  c^erator  from  a  Hilbert  space  X  to  X ,  and  we  define 

i = B'is+i 

and 

vk  =  +  irX).  1  <  *  < 

where  is  the  nonnegative  self-adjoint  square  root  of  R'" .  Then, 

i=l  1=1  t=l 

If  there  exists  a  real  number  p  such  that 

Lzw  oti/  T  Ziv*  v.  ^  ^ 

0  R  Lo  >  p,  1>  P, 

then  (5.37)  yields 

Yi  ^  Y, 

i=l  ♦=! 


(5.38) 

(5.39) 

(5.40) 

(5.41) 

(5.42) 

(5.43) 


Under  the  hypotheses  of  Section  4,  a  state-space  realization  of  the  input/output  system  in  (3.8), 
(3.9),  and  (4.1)  is  (5.11)-(5.13)  with  the  additional  output  equation 

zt=C^i*-H5“ut-bLru't-  (5-44) 

If  y  is  the  measured  output,  a  finite- time  Hoo  filter  for  one-step-ahead  prediction  of  a:*+i  has  the 
form  (5.14)  and  the  corresponding  one-step-ahead  prediction  of  «t+j  is  given  by  (5.15),  but  the 
Kalman  and  Hoo  filters  use  different  operators  T*  and  F*. 

For  the  Hoo  filter  that  is  related  to  the  order-recursive  minimax  parEuneter  estimation  problems 
in  Section  4,  the  operators  Ti  and  F*  are  given  by  (5.17)  with  F*  euid  T*  generated  by  (5.26)- 
(5.28)  and  (5.36).  That  these  operators  Ft  and  Tt  indeed  yield  an  Hoo  filter  will  be  established  in 
Theorem  5.2.  But  whatever  the  meaning  of  the  state-space  filter  in  (5.14)  and  (5.15)  for  this  choice 
of  Ft  and  T*,  the  one-step-ahead  prediction  of  sn+i  by  this  filter,  when  i\  =  0,  can  be  written  as 
in  (5.18)  with  the  coefficients  a*X  and  a*“  given  by  (5.30)  and  (5.31)  with  =  0  and  ft  given  by 
(5.36) — and  these  are  the  same  parameters  a'^  and  a*“  that  solve  Problems  4.5  and  4.14. 


Theorem  5.2  Suppose  that  the  coercivHy  hypotheses  in  Theorem  4.7  hold  and  that  (5.42)  holds. 
Assume 

xi  =  Bx,  Xi  =  Bx,  (5.45) 

tet  =  F“'/^«)t,  l<k<N,  (5.46) 

with  x,x  E.  X  and  wt  €  W.  Let  X2-.n  generated  by  (5.14)  with  ft  and  Ft  generated  by  (5.17), 
(5.26)-(5.28),  and  (5.36),  and  define  the  prediction  of  z  by 

it  =  C*xt  +  FS“tn,  l<k<N.  (5.47) 

Then 

X]  lit  -  X*  1^  <  (t*  -  P^)(  ^  l«i'*  1^  +  I*  -  •  (5  48) 

1=1  t=i 
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Theorem  5.2  follows  from  (5.43)  &nd  the  following  lemma. 


Lemma  5.3  (Duality  between  Control  and  State  Estimation)  Ltt  X,  W ,  Z,  and  X  bt 
Hilbert  spaces  with  5*  €  B{X,X),  B*  €  B{Z,X).  Ct  €  B{X,W),  D*  €  B{Z,W)  (\<k<N)  and 
CeB{W,Z).  Define 

Bk  =  C!v+i-*i  Ck  = 

Dk  —  ~  ^Af+i-*>  (5.49) 

l<k<N. 

Consider  the  two  systems 

^*+1  =  Skik  +  BkVk,  1  < 

TIk  =  Ckik  +  Dki'k,  1  <  t  <  ^.  (5.50) 

i  =  c^N+i,  =  0, 

and 


U+i  =  Skik  +  Bktik,  l<k<N, 
i'k=Ck^k  +  Dkrtk,  l<k<N,  (5.51) 

=  crl 

(The  ^k,  (,  Vk,  Vk  >n  (5.50)  have  no  relation  to  the  r)k,  Vk  m  (5.51),  except  for  belonging  to 
the  same  respective  spaces.)  Let  L  6  Bi^Z  x  Z  x  x  Z,W  x  W  x  ■■■  x  W  x  X)  be  the  operator 
such  that,  for  the  system  in  (5.50), 

(mN,0  =  Luis  (5.52) 

and  let  L  €  B{W  xW  x  xW  x  X,Z  x  Z  x  x  Z)  be  the  operator  such  that,  for  the  system 
in  (5.51), 

vi.N  —  (5.53) 

Also,  let  F'  and  /"  be  the  isometric  isomorphisms  such  that 

/’’(['ll  'IJ  •  •  •  ml  0  =  {[m  •'72  'll],  0,  /■'["i  •'7  ■  ■  ■  >'n]=Wn  ■  ■  ■  I'll  (5.54) 


Then 


L  =  rL^P. 


(5.55) 


Calculation  of  the  adjoint  operator  L*  yields  Lemma  5  3. 


Proof  of  Theorem  5.2  Let  (5.50)  be  the  closed-loop  control  system  in  the  closed-loop  LQ  game 
(Problem  4.8)  and  let  (5.51)  be  the  prediction-error  system  for  the  filter  specified  in  Theorem  5.2. 
Then  (5.48)  follows  from  (5.43)  and  the  fact  that  L  and  L*  have  the  same  norm.  (Related  arguments 
appear  to  be  used  in  [6].)  □ 
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6  Problems  with  Finite  Amounts  of  Data:  Convergence 

issume  Hypothesis  3  1,  the  hypotheses  of  Problem  3.6,  (3.24)  and  (3.25)  for  I;  >  1,  and  Hypoth¬ 
esis  4.1.  In  this  section,  we  assume  also  that  the  input  space  U  and  the  output  spaces  V  and  Z  are 
finite-dimensional.  We  do  no<  assume  that  the  noise  space  IV  or  the  state  space  X  introduced  in 
Problem  3.6  are  finite-dimensional. 


6.1  Least-Squares  Estimation  and  Prediction 

For  each  t  >  0,  we  define 


where 


with  Pi  =  0  and  Uj  =  0  for  «  <  0.  In  this  section,  we  write  ./£A?(aJ|/v> instead  of  j£;sr(a*‘', <J*“) 
for  the  objective  functional  in  Problem  3.3. 

Problem  6.1  For  f  >  0,  find  aj*^(t)  and  aojv(0  io  minimize  ,  aJIW) 

Theorem  6.2  If  Jssio** •«  Problem  3.3  is  coercive  in  and  ,  then 

ts  coercitie  in  and  for  sufficiently  large  t.  In  this  case,  if  a'ypj  and  solve  Problem  3.3 

and  aj!'^(t)  and  ao!‘^(t)  solve  Problem  6.1,  fAen 

=  floV •  (6.3) 


The  proof  of  Theorem  6.2  is  a  special  case  of  the  proof  of  Theorem  6.5. 

Because  of  the  finite  dimensionality  assumed  in  this  section,  all  norms  for  and  are 
equivalent,  so  that  the  limits  in  (6.3)  and  all  other  limits  in  this  section  are  unambiguous. 


6.2  Minimax  Estimation  and  Prediction 

For  each  t  >  0,  we  define  right-hand  side  of  (6.1)  with 

N  N  N 

M<)  =  (6-4) 

»=0  1=1  <=1 

with  Pi  =  0,  Zi  =  0,  and  Ui  =  0  for  i  <  0.  Abo,  we  define  /^^(aJjvr.Ojfyv)  *n  (4.38)  with 
lEN{a'yj^,ao:N,a[‘ff)  replaced  by  JEN{aW,ao:N 

Problem  6.3  Problem  4.3  ivith  JEN{aW,ay‘ff  replaced  by  J£^(oIlAr.Oo“Af>“*fw)- 

Problem  6.4  Problem  4.12  with  Jen{oi*N'^i'.n)  ^pl^f^^i  h  ^bn(^i!n'°Vn) 
JEiv(ailV>®o’Af'®ljv)  by 

Theorem  6.5  If  the  coercivity  hypotheses  in  Problem  4.3  hold,  then  the  coercivity  hypotheses  in 
Problem  6.3  hold  for  sufficiently  large  t.  In  this  case,  if  OoIati  ®*fAf  *<’ft>e  Problem  4.3  and 
®o!iv(0>  ®ifAr(0  Problem  6.3,  lAen 

lim  a',»^(t)  =  a\*f„ ,  Urn  =  Oojv.  a*if;ir(t)  =  • 

I— *00  1-^00  I— ♦00 

Similarly  for  Problems  4.12  and  6.4. 


(6.5) 


Proof  Problems  4.3  and  6.3:  Since  U,  Y,  and  Z  are  finite-dimensional,  oJ“yv> 
be  represented,  respectively,  by  finite-dimensional  column  vectors  a“,  a*',  and  o*,  all  of  which  can 
be  included  in  a  parameter  vector  a  €  C",  where  n  is  the  total  number  of  scalar  parameters  in 
a^ff,  and  a{*s-  Then 

^£JvKfN.«OJV.a*fA^)  =  <<?«.<»)  +  2«e{/?,o)  -|-  J£w(0,0,0), 

=  {Q(t)a,a)+2tkt{m.0‘)  +  (6-6) 

where  P,0it)  €  C”,  Q  and  ^(t)  are  n  x  n  matrices  and  {■,  )  is  the  Euclidean  inner  product  on  C". 
The  coercivity  hypotheses  in  the  parameter-estimation  problems  mean  that  certain  submatrices  of 
Q  and  Q(t)  must  be  either  positive  definite  or  negative  definite.  When  these  conditions  hold,  Q  and 
Q{t)  are  nonsingular  and  the  solutions  to  Problems  4.3  and  6.3  are  (for  sufficiently  large  t) 

a  = 

a(t)  =  -Q-^(t)l3(t)- 

It  follows  from  the  definitions  in  Section  2,  the  operator  definitions  at  the  beginning  of  Section  3, 
and  the  definitions  of  elements  of  Q  and 

0  are  the  limits  of  the  corresponding  elements  of  Q(t)  and  /?(<).  Therefore,  when  Q  is  nonsingular, 

limt_oo  =  Q~^  and  lim,_oo  «(<)  =  “■ 

Essentially  the  same  argument  works  for  Problems  4.12  and  6.4.  Q 

For  1  <  fc  <  JV,  we  define  J^ENM-N’^o-syi-N)  ^  (4.8)-(4.10)  but  with  Jen  replaced  by 

in  (4.8).  Also,  we  define  “  in  (4.38)  and  (4.41)  with  Jen  replaced  by  4*^  in 

(4.38).  Theorems  4.4  and  4.13  hold  with  JEN,k  and  JEN,k  replaced  by  Jsl/^k  4*^  *,  respectively. 

Problem  6.6  Problem  4.5  with  JenM^’^uN'^o^N'^vn)  replaced  by  JEN,ki'^v.N>^o‘N>‘^vN)  •** 
Theorem  4.4  and  Problem  4.5. 

Problem  6.7  Problem  4.14  with  Jen ,k((^uN >*^v.n)  replaced  by  JEN,ki‘^uN'°vN)  Theorem  4.13 
and  Problem  4.14  and  •f£:N(®*jvi®o-JVtO)  replaced  by 

Theorem  6.8  If  the  coercivity  hypotheses  in  Problem  4.5  hold,  then  the  coercivity  hypotheses  in 
Problem  6  6  hold  for  sufficiently  large  t.  In  this  case,  if  ®o!W  Problem  4.5  and 

OyNi*)  <*o“jv(0  Problem  6  6,  then 

Urn  a;r;,(f)  =  Urn  o5“;,(f)  =  aJIV-  (6-8) 

<—♦00  <*^oo 

Similarly  for  Problems  4.14  and  6.7. 


Proof  Problems  4.5  and  6.6:  For  1  <  fc  <  N,  all  of  the  parameters  in  a'*^,  oj"^,  and  ajfj  can  be 
represented  by  a  column  vector  a*,  and 


=  (<?*«*.«*)  +  2  Wt,a*)  +  /syv.t  (0,0,0), 

+  2»e(/?t(0,ot)  +  4N.k(0>0,0). 


(6.9) 


The  matrix  Qt  and  the  column  vector  /?*  are  obtained  by  a  finite  number  of  matrix  multiplications, 
additions,  and  inversions  starting  with  submatrices  of  the  Q  and  0  in  (6.6),  and  similarly  for  Qt(t) 
and  /?t(<).  Hence  Qt(t)  and  /?t(f)  are  continuous  functions  of  Q{t)  and  Sit),  so  that  Qt(<)  and  /?*(<) 
converge  to  Q*  and  Sk  as  Q(0  and  ^(f)  converge  to  Q  and  S-  H  follows  that,  for  each  k,  the  solution 
to  the  linear-quadratice  minimax  problem  on  the  right-hand  side  of  (4.9),  with  Jenmi  replaced  by 
4n  t+i>  converges  to  the  solution  of  the  original  minimax  problem  on  the  right-hand  side  of  (4.9). 

Essentially  the  same  argument  works  for  Problems  4.14  and  6.7.  O 
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Conclusions 


From  the  way  we  use  norms  and  inner  products  of  equivalence  classes  of  scalar  sequences  in  defining 
and  analyzing  the  parameter-estimation  problems,  it  follows  that  solutions  to  any  of  the  parameter- 
estimation  problems  in  Sections  3  and  4  and  the  asymptotic  values  of  the  estimated  parameters  in 
any  of  the  problems  in  Section  6  depend  only  on  the  equivalence  classes  to  which  u,  w,  t,  y,  and  r 
belong.  Therefore,  the  asymptotic  parameter  estimates  are  unaffected  by  changing  a  finite  number 
of  terms  in  any  of  the  measurement  sequences  or  the  known  input  sequence  u  or  by  changing  the 
initial  state  vector  of  the  plant,  since  the  plant  is  assumed  to  be  exponentially  stable.  This  is  to  be 
expected,  and  the  fact  that  it  is  an  immediate  consequence  of  our  formulation  suggests  that  Hilbert 
spaces  containing  equivalence  classes  of  deterministic  scalar  sequences  provide  a  natural  setting  for 
many  parameter-estimation  problems. 

The  class  of  least-squares  problems  in  Section  3  includes  many  common  problems  in  parameter 
estimation  and  adaptive  filtering  and  prediction,  but  to  keep  the  discussion  manageable,  we  have 
omitted  several  classes  of  problems  that  can  be  analyzed  by  straightforward  generalizations  of  the 
methods  here.  For  example,  the  results  in  Sections  3-6  can  be  generalized  to  problems  with  correlated 
process  and  sensor  noise  and  problems  in  which  the  summations  over  the  y  terms  and  u  terms  in 
the  ARX  model  have  different  limits  Ny  and  instead  of  the  same  limit  N. 

The  results  in  Sections  3-5  characterize  fully  the  asymptotic  values  of  least-squares  or  minimax 
parameter  estimates,  but  there  are  only  two  possibilities  for  computing  these  limiting  values  for  a 
particular  problem:  (1)  solution  of  the  parameter-estimation  problem  for  long  input/output  data 
sequences,  (2)  solution  of  either  the  corresponding  state-space  control  problem  or  the  corresponding 
state-space  filtering  problem,  either  of  which  requires  complete  knowledge  of  the  plant  and  noise 
statistics.  Of  course,  if  the  information  required  for  (2)  is  known,  then  there  is  no  reason  for 
parameter  estimation. 

On  the  other  hand,  it  often  occurs  that  some  rough  estimates  of  plant  and  noise  characteristics 
are  known  but  the  a  priori  information  is  not  sufficiently  accurate  for  designing  a  filter  or  perhaps  a 
controller.  In  this  case,  the  characterizations  in  this  paper  can  be  used  to  compute  rough  indications 
of  the  results  that  will  be  obtained  with  the  different  parameter-estimation  problems  and  for  dif¬ 
ferent  ARX  orders.  For  example,  comparing  Markov  parameters  for  steady-state  and  finite-interval 
Kalman  filters  computed  for  a  priori  models  of  the  plant  and  noise  should  indicate  how  large  a 
finite-dimensional  ARX  model  must  be  used  in  LS  identification  if  the  estimated  parameters  are  to 
approximate  the  Markov  parameters  of  the  steady-state  Kalman  filter  for  the  true  plant  and  noise. 
Such  a  priori  comparison  should  be  useful  in  applying  the  OKID  method  for  system  identification 
(3,  4,  5].  With  a  priori  plant  and  noise  models,  (3.27)  and  (3.28)  can  be  used  to  compare  roughly 
the  performance  levels  to  be  expected  of  adaptive  LS  filters  of  different  orders,  since  the  optimsd 
values  of  the  performance  indices  in  the  control  problems  are  easy  to  compute  with  Riccati  matrices. 
Similar  a  priori  comparisons  between  LS  and  minim^LX  parameter  estimation  and  filtering  can  be 
obtained  with  (3.37)  and  (3.41).  Also,  the  control  problems  in  Section  4  can  be  used  for  a  priori 
models  of  the  plant  and  noise  to  get  rough  a  priori  lower  bounds  for  the  values  of  7  that  can  used 
in  minimax  parameter  estimation  and  filtering  for  the  true  plant. 
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